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OM protein - protein search, using sw model 

Run on: January 30, 2004, 10:50:47 ; Search time 71.1206 Seconds 

(without alignments) 

1073.493 Million cell updates/se 



Title: US- 09-9 8 9-4 8 1-3 

Perfect score : 252 7 

Sequence: 1 RAYYSTDENLILSPLLGNVC LPSEVDKALLGSVKDSIVQG 481 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 1107863 seqs, 158726573 residues 

Total number of hits satisfying chosen parameters: 1107863 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : A_Geneseq_19Jun03: * 

1: / SIDSl/gcgdat a/genes eq/ geneseqp-embl/AA19 8 0 . DAT : * 
2 : / SIDSl/gcgdat a/ genes eq/ geneseqp-embl/AAl98 1 . DAT : * 
3 : /siDSl/gcgdata/geneseq/geneseqp-embl/AAl982 . DAT : * 
4 : / SIDSl/ gcgdata/ gene seq/ geneseqp-embl/AAl98 3 . DAT : * 
5 ; /SIDSl/gcgdata/geneseq/geneseqp-embl/AAl984 .DAT: * 
6: /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1985 . DAT: * 
7 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AAl98 6 . DAT : * 
8 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AAl987 . DAT : * 
9 : /siDSl/gcgdata/geneseq/geneseqp-embl/AAl988 . DAT: * 
10 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AAl98 9 . DAT : * 
11 : / SIDSl/ gcgdata/ genes eq/ geneseqp— embl/AA19 9 0 . DAT : * 
12 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AAl991 . DAT: * 
13 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AAl992 . DAT: * 
14 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AAl993 . DAT: * 
15 : /siDSl/gcgdata/geneseq/geneseqp-embl/AAl994 . DAT : * 
16 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AAl99 5 . DAT : * 
17 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1996 . DAT : * 

18 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AAl997 .DAT: * 

19 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AAl99 8 . DAT: * 
20 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AAl999 . DAT : * 
21 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA2 000 . DAT: * 
22 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA2001 . DAT: * 
23 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA2 0 02 . DAT: * 
24 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA2003 . DAT : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed 



and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
AAW59045 

ID AAW59045 standard; Protein; 481 AA. 
XX 

AC AAW5 9 04 5; 
XX 

DT ll-AUG-1998 (first entry) 
XX 

DE Human MNTF1-F3 protein fragment. 
XX 

KW Motorneuronotrophic factor; MNTF-1; MNTF1-F3; human; axon regeneration; 
KW motoneurone; diagnose; treatment; disease; wound healing; scar tissue; 
KW keloid. 
XX 

OS Homo sapiens . 
XX 

FH Key Location/ Qualifiers 

FT Misc-dif f erence 194 

FT /note= "encoded by TTC" 

FT Misc-dif f erence 248 

FX /note= "encoded by AGC" 

FT Misc-dif f erence 461 

FT /note= "encoded by ACY" 

XX 

XX 

PN W09813492-A2 . 
XX 

PD 02-APR-1998. 
XX 

PF 22-SEP-1997; 97WO-US 17 142 . 
XX 

PR 12-SEP-1997; 97US-092 8 8 62 . 
PR 27-SEP-1996; 96US-0026792 . 
PR 15-NOV-1996; 96US-0751225 . 
XX 

PA (KMBI-) KM BIOTECH INC. 
XX 

PI Chau RMW; 
XX 

DR WPI; 1998-230703/20. 
DR N-PSDB; AAV11747. 
XX 

PT Motoneurotrophic factor MNTF1-F3 and MNTF1-F6 - useful for 

PT motoneuron regeneration, diagnosing or treating motoneuron disease 

PT and to accelerate wound healing without scar formation 

XX 

PS Claim 3; Fig 2A; 78pp; English. 
XX 

CC This sequence represents a novel human motoneurotrophic factor, MNTF1 F 

CC Such factors are used to promote regeneration of the axon of a 

CC motoneurone, to diagnose and treat motoneurone disease in a mammal or t 

CC accelerate wound healing whilst concomitantly minimising or inhibiting 

CC scar tissue and/or keloid formation in an area associated with a wound. 

CC For promoting axonal regeneration, the polypeptide is administered at a 

CC concentration of 5 ng-50 mg, whereas for inhibiting hereditary 

CC motoneurone disease, the dosage is 5-100 (especially 30-50) ng per kg 

CC body weight. 



XX 

SQ Sequence 481 AA; 



Query 


Match 


99.4%; Score 2512; DB 19; Length 481; 




Best 


Local Similarity 99.4%; Pred. No. 1.8e-235; 


0; 


Matches 478 


; Conservative 1; Mismatches 2; Indels 0; Gaps 
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RESULT 2 
AAU20459 

ID AAU20459 standard; Protein; 729 AA. 
XX 

AC AAU20459; 
XX 

DT 06-DEC-2001 (first entry) 
XX 

DE Human secreted protein, Seq ID No 451. 
XX 
KW 
KW 
KW 



Immunomodulatory; human immunodeficiency virus; HIV; anaemia; angina; 
rheumatoid arthritis; antiarteriosclerotic ; cardiant; vascular; 
cerebroprotective ; thrombolytic; antimicrobial ; ophthalmological ; 



KW cytostatic; Alzheimer's disease; Parkinson's disease; human; cancer; 

KW multiple sclerosis; cancer; hyperprolif erati ve disorder; infection; 

KW Gaucher's disease; neurological disease; cerebrovascular disorder; 

KW thrombosis; wound healing. 
XX 

OS Homo sapiens. 
XX 

PN WO200155326-A2 . 
XX 

PD 02-AUG-2001. 
XX 

PF 17-JAN-2001; 2001WO-US01347 . 
XX 

PR 31-JAN-2000; 2000US-0179065 . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Rosen CA, Barash SC, Ruben SM; 

XX 

DR WPI; 2001-451931/48. 

DR N-PSDB; AAS33168. 



XX 
PT 



New nucleic acids and polypeptides, useful for diagnosing, preventing 

PT or treating medical conditions - 
XX 

PS Claim 11; SEQ ID No 451; 753pp; English. 
XX 

CC The invention relates to novel isolated nucleic acid molecules (I) 

CC encoding human secreted proteins (II). (I) and (II) are used to prevent, 

CC treat or ameliorate a medical condition in e.g. humans, mice, rabbits, 

CC goats, horses, cats, dogs, chickens or sheep. (I) and (II) may be used m 

CC the prevention, treatment and diagnosis of diseases associated with 

CC inappropriate expression of secreted proteins. (I) and complementary 

CC sequences may also be used as DNA probes in diagnostic assays (e.g. 

CC polymerase chain reactions (PCR)) to detect and quantitate the presence 

CC of similar nucleic acid sequences in samples, and so which patients may 

CC be in need of restorative therapy. (II) may also be used as antigens m 

CC the production of antibodies and in assays to identify modulators 

CC (agonists and antagonists) of the expression and activity of the secreted 

CC proteins. The anti-(II) antibodies and antagonists may also be used to 

CC down regulate expression and activity of (II). The anti-(II) antibodies 

CC may also be used as diagnostic agents for detecting the presence of (II) 

CC in samples (e.g. by enzyme linked immunosorbant assay (ELISA) ) . The 

CC disorders include for example: immune/ autoimmune diseases (e.g. HIV 

CC (human immunodeficiency virus) infections, anaemia, rheumatoid arthritis 

CC and multiple sclerosis), cancers and hyperprolif erative disorders (e.g. 

CC melanomas, neoplasms of the breast or liver, Sezary syndrome and 

CC Gaucher's disease), neurological diseases (e.g. Alzheimer's disease, 

CC Parkinson's disease and Charcot-Marie-Tooth disease), cardio-/ 

CC cerebrovascular disorders (e.g. cardiac arrest, tachycardia, 

CC angina and thrombosis), infections caused by bacteria, viruses and 

CC fungi and ocular disorders (e.g. corneal infections). (I) and (II), 

CC agonists, antagonists and antibodies can also be used to promote wound 

CC healing, maintain organs before transplantation, and support cell culture 

CC of primary tissues. AAU20342-AAU20666 represent human secreted protein 

CC amino acid sequences, and related sequences of the invention. 

CC Note: The sequence data for this patent did not appear in the printed 



CC specification but was obtained in electronic format directly from WIPO 

CC at : ftp . wipo . int/pub/published_pct_sequences . 

XX 

SQ Sequence 72 9 AA; 

Query Match 98.3%; Score 2485; DB 22; Length 729; 

Best Local Similarity 99.2%; Pred. No. 1.5e-232; 

Matches 474; Conservative 1; Mismatches 3; Indels 0; Gaps 0; 

Qv 4 ySTDENLILSPLLGNVCFSSSQYSICFTLGSFAKI YADTFGDINYQEFAKRLWGDIYFNP 63 

| | | | | | | | | I I I I I I 1 I II I I I E M I i I 1 I I ! 1 M i I I I I I I I I I I 

Db 49 YSTDENLILSPLLGNVCFSSSQYSICFTLGSFAKIYADTFGDINYQEFAKRLWGDIYFNP 108 

Qy 64 KTRKFTKKAPTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLN 123 

M I 1 I I I I I I I I I I I I I I I I I I I I I IE 1 I I I I I I 1 I M I I 1 I I I I I I I I I I I I 

Db 109 KTRKFTKKAPTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTXEELKLN 168 

Qv 12 4 IRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDPD 183 

| | | | M I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I MINIMI 

Db 169 XRPLLRLVCKKFFGEFTGFVDMCVQHI PSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDPD 228 

Qv i 8 4 GPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLW 243 

MIM Ml I M I I I I I I I I II I I I M I I M I I I I I I I I 

Db 22 9 GPLMCHTTKMYSTDDGVQFHAFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICT VGRLW 288 

Qv 244 ISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIA 303 

| | | | 1 | | M II 1 I II M I I I I I I M I I I II I M I I I I I I M I I I I I I M I I I I I I I I I I I 
Db 289 ISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIA 348 

Qv 304 VEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEI 363 

| | | | 1 | | | | 1 M I II M I I I II I II I II I I I I I I II I M I M II I I I I I 1 I I I I I 

Db 349 VEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEI 408 

Qv 364 DIKVADP VVTFCETVVETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEVVQITWNR 423 

| | | | || I I I M I I 1 I M I I M I I I I I I I M I M I 1 I 1 I I I I I I I 1 II 1 II I I I I I 

Db 409 DIKVADPVVTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGI^DIENEWQITWNR 468 

Qy 424 KKLGEFFQTKYDWDLLAARS IWAFGPDATGPNI LVDDTLP SEVDKALLGSVKDS I VQG 481 

| | | | | | | | | || | | 1 M M I I II II I I M I I I I I I M 1 I I I I M I I I I II I I II I M M 
Db 469 KKLGEFFQTKYDWDLL7VARS IWAFGPDATGPNI LVDDTLP SEVDK7\LLGSVKD SI VQG 526 



RESULT 3 
AAU20616 

ID AAU20616 standard; Protein; 729 AA. 
XX 

AC AAU20616; 
XX 

DT 06-DEC-2001 (first entry) 
XX 

DE Human secreted protein, Seq ID No 608. 

XX 

KW Immunomodulatory; human immunodeficiency virus; HIV; anaemia; angina; 

KW rheumatoid arthritis; antiarteriosclerotic; cardiant; vascular; 

KW cerebroprotective; thrombolytic; antimicrobial; ophthalmological ; 

KW cytostatic; Alzheimer's disease; Parkinson's disease; human; cancer; 

KW multiple sclerosis; cancer; hyperprolif erative disorder; infection; 



KW Gaucher' s disease; neurological disease; cerebrovascular disorder; 

KW thrombosis; wound healing. 

XX 

OS Homo sapiens . 
XX 

PN WO200155326-A2 . 
XX 

PD 02-AUG-2001. 

XX 

PF 17-JAN-2001; 2 001WO-US01347 . 
XX 

PR 31-JAN-2000; 20 0 OUS-017 9065 . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Rosen CA, Barash SC, Ruben SM; 
XX 

DR WPI; 2001-451931/48. 

DR N-PSDB; AAS33325. 
XX 

PT New nucleic acids and polypeptides , useful for diagnosing, preventing 

PT or treating medical conditions - 

XX 

PS Claim 11; SEQ ID No 608; 753pp; English. 
XX 

CC The invention relates to novel isolated nucleic acid molecules (I) 

CC encoding human secreted proteins (II). (I) and (II) are used to prevent, 

CC treat or ameliorate a medical condition in e.g. humans, mice, rabbits, 

CC goats, horses, cats, dogs, chickens or sheep. (I) and (II) may be used i 

CC the prevention, treatment and diagnosis of diseases associated with 

CC inappropriate expression of secreted proteins. (I) and complementary 

CC sequences may also be used as DNA probes in diagnostic assays (e.g. 

CC polymerase chain reactions (PCR) ) to detect and quantitate the presence 

CC of similar nucleic acid sequences in samples, and so which patients may 

CC be in need of restorative therapy. (II) may also be used as antigens in 

CC the production of antibodies and in assays to identify modulators 

CC (agonists and antagonists) of the expression and activity of the secrete 

CC proteins. The anti-(II) antibodies and antagonists may also be used to 

CC down regulate expression and activity of (II). The anti-(II) antibodies 

CC may also be used as diagnostic agents for detecting the presence of (II) 

CC in samples (e.g. by enzyme linked immunosorbant assay (ELISA) ) . The 

CC disorders include for example: immune/ autoimmune diseases (e.g. HIV 

CC (human immunodeficiency virus) infections, anaemia, rheumatoid arthritis 

CC and multiple sclerosis), cancers and hyperprolif erative disorders (e.g. 

CC melanomas, neoplasms of the breast or liver, Sezary syndrome and 

CC Gaucher's disease), neurological diseases (e.g. Alzheimer's disease, 

CC Parkinson's disease and Charcot-Marie-Tooth disease), cardio-/ 

CC cerebrovascular disorders (e.g. cardiac arrest, tachycardia, 

CC angina and thrombosis), infections caused by bacteria, viruses and 

CC fungi and ocular disorders (e.g. corneal infections). (I) and (II), 

CC agonists, antagonists and antibodies can also be used to promote wound 

CC healing, maintain organs before transplantation, and support cell cultui 

CC of primary tissues. AAU2 034 2-AAU2 0 666 represent human secreted protein 

CC amino acid sequences, and related sequences of the invention. 

CC Note: The sequence data for this patent did not appear in the printed 

CC specification but was obtained in electronic format directly from WIPO 

CC at : ftp . wipo . int/pub/published_pct_sequences . 



XX 

SQ Sequence 729 AA; 



Query Match 98.3%; Score 2485; DB 22; Length 729; 

Best Local Similarity 99.2%; Pred. No. 1.5e-232; 

Matches 474; Conservative 1; Mismatches 3; Indels 0; Gaps 0; 

Qy 4 YSTDENLILSPLLGNVCFSSSQYSICFTLGSFAKI YADTFGDINYQEFAKRLWGDI YFNP 63 

| | M I I It M I I I I I I I I I I I I I I I I till I I I I I I I I I I M 

Db 49 YSTDENLILS PLLGNVCFSSSQYS I CFTLGSFAKI YADTFGDINYQEFAKRLWGDI YFNP 108 

Qv 64 KTRKFTKKAPTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLN 123 

| | | | I | | 1 I I | I I I I I I I I I I I I II I M I I I I I I I I I I II I M M I I i 1 

Db 109 KTRKFTKKAPTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTXEELKLN 168 

Qy 124 IRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDPD 183 

M I I I I I 1 I I M I I I I I I M I II I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I 
Db 169 IRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDPD 228 

Qy 184 GPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLW 243 

| Illhll I I I I I I II M 11 I I I 1 I I I I I I I 1 M t I I I I I M 

Db 22 9 GPLMCHTTKMYSTDDGVQFHAFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLW 288 

Qv 244 ISV7VRYHIEWRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIA 303 

M 1 I I i I I 1 II I I I I I I I M I I I I I I I M I I 1 I I I 1 1 I I I I I I M I II M I I 1 I I 

Db 28 9 ISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIA 34 8 

Qy 304 VEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEI 363 

I t I 1 1 I I I I 1 M I t t I I 1 I M I I I 1.1 I 1 I I I I 1 I I I I I I I 1 I I I M 

Db 349 VEPWPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEI 408 

Oy 364 DIKVADPVVTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWNR 423 

| | | | | I I I M 1 1 I I I I I I I I I I i I I I I I I M 1 I 1 I I I I I I I I I I I M I I I I I M I 1 I I I 1 
Db 409 DIKV7VDPVVTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLMDIENEWQITWNR 468 

Qy 424 KKLGEFFQT KYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGSVKDSIVQG 481 

| | | M | 1 | | | I I 1 I 1 I I I I I I I I I I I I 11 I I I I I I M 1 II I I I M I I I I 1 I i I I I I I I 
Db 469 KKLGEFFQT KYDWDLLAARS IWAFGPDATGPNI LVDDTLPS EVDKALLGSVKDS I VQG 52 6 



RESULT 4 
ABB61030 

ID ABB61030 standard; Protein; 975 AA. 
XX 

AC ABB61030; 
XX 

DT 26-MAR-2002 (first entry) 
XX 

DE Drosophila melanogaster polypeptide SEQ ID NO 9882. 
XX 

KW Drosophila; developmental biology; cell signalling; insecticide; 

KW pharmaceutical. 

XX 

OS Drosophila melanogaster. 
XX 

PN WO200171042-A2. 
XX 



PD 27-SEP-2001, 
XX 

PF 23-MAR-2001; 2001WO-US09231 . 
XX 

PR 23-MAR-2000; 2 00 0US-191637P . 

PR ll-JUL-2000; 2000US-0614150 . 
XX 

PA (PEKE ) PE CORP NY. 
XX 

PI Venter JC, Adams M, Li PWD, Myers EW; 
XX 

DR WPI; 2001-656860/75. 

DR N-PSDB; ABL05133. 
XX 

PT New isolated nucleic acid detection reagent for detecting 1000 or more 

PT genes from Drosophila and for elucidating cell signalling and cell-cell 

PT interactions - 
XX 

PS Disclosure; SEQ ID NO 9882; 21pp + Sequence Listing; English. 



The invention relates to an isolated nucleic acid detection reagent 



XX 

cc 

CC capable of detecting 1000 or more genes from Drosophila. The invention is 

CC useful in developmental biology and in elucidating cell signalling and 

CC cell-cell interactions in higher eukaryotes for the development of 

CC insecticides, therapeutics and pharmaceutical drugs. The invention 

CC discloses genomic DNA sequences ( ABLl 617 6-ABL30511 ) , expressed DNA 

CC sequences ( ABL0184 0-ABL1617 5 ) and the encoded proteins 

CC (ABB57737-ABB72072) . 

CC The sequence data for this patent did not form part of the printed 

CC specification, but was obtained in electronic format directly from WIPO 

CC at ftp.wipo.int/pub/published_pct_sequences. 

XX 

SQ Sequence 975 AA; 

Query Match 73.1%; Score 1848; DB 22; Length 975; 

Best Local Similarity 72.4%; Pred. No. 2.8e-170; 

Matches 347; Conservative 57; Mismatches 75; Indels 0; Gaps 0; 



Qy 


3 


YYSTDENLILSPLLGNVCFSSSQYSICFTLGSFAKIYADTEGDINYQEFAKRLWGDIYFN 

| : | : | | : : | | : hi! 1 Mi! 1 1 1 1 : 1 1 1 1 : : I = ! 1 1 ! ! 1 M : 1 1 1 

YGAPDDNLLVSPILGNVCFASSLYGFCFTLKSFAKLYADTYEGVAYLDFAKRLWGDMYFN 


62 


Db 


294 


353 


Qy 


63 


PKTRKFTKKAPTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKL 

11M|:|| I : I : I 1 1 1 ! 1 1 1 1 1 1 : 1 1 : : 1 1 1 1 1 1 M ! : 1 ' ' M : : : 1 1 1 : 1 
SKTRKFSKKQPHNSAQRSFVEFILEPMYKLIAQWGDVDTTLSDTLAELNVRVSKEEMKS 


122 


Db 


354 


413 


Qy 


123 


NIRPLLRLVCKKFFGEFTGFVDMCVQHI PSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDP 

| | | | 1 I I I I 1 : 1 1 ::! 1 1 1 1 1 1 : 1 i 1 1 ! 1 1 : : 1 1 M : 1 : 1 1 : 
NIRPLLRLVCNRFMGDCSGFVDMCVEHIKSPLENAKRKVDHI YTGPKEGDIYRDMISCNQ 


182 


Db 


414 


473 


Qy 


183 


DGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQ PVKVLGENYTLEDEEDSQICTVGRL 
| \ | | : : 1 | : 1 1 I : : 1 1 1 : i M 1 1 : I 1 1 M 1 1 ! : 1 1 1 1 1 : 1 1 1 1 1 
YGTLMVH S S KMYPNDDCT FFQVLARI VS GT LHAGQEVRVLGENYTLQDEEDS RI LQVGRL 


242 


Db 


474 


533 


Qy 


243 


WISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKI 

| : : j | : 1 : 1 1 1 1 M 1 1 1 1 1 1 1 : 1 1 1 1 1 t : 1 1 : 1 : 1 1 1 1 1 1 1 1 1 i : 1 1 1 
WVFESRYKVELNRVPAGNWVLIEGIDQCIVKTSTIVDINVPEDLYI FRPLKFNTQS I IKI 


302 


Db 


534 


593 



Qy 303 AVEPWPSELPmLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSE 362 

M | | | M I I I i I I I t I I I I I t I II i I : I : I I I I II I I I ! I I I I I I I I I I I 

Db 594 AVEPVNPSELPKMLDGLRKVNKSYPLLSTRVEESGEHVILGTGELYLDCVMHDLRKMYSE 653 

Qy 363 IDIKVADPVVTFCETVVETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWN 422 

| | | M I I 1 I I I I I I I I I I I I I I I II I : I I I I I I I I I I I 

Db 654 I D I KVAD PWAFC ET WET S S LKC FAET PN KKN K I TMI S E P LEKGLAED I ENGT VC I NWN 713 

Qy 423 RKKLGEFFQTKYDWDLLAARS IWAFGPDATGPNI LVDDTLP S EVDKALLGSVKDS I VQG 481 

: | : : | E | M | I [ I I I I I I I I I I I I I I : I I I I I I I M I 1 I II : I I I I I I M 

Db 714 KKRIGEFFQVNYDWDLLAARSIWAFGPDSTGPNILVDDTLPSEVDKNLLTAVKDSIVQG 772 



RESULT 5 
AAM79209 

ID AAM79209 standard; Protein; 858 AA. 
XX 

AC AAM7 92 09; 
XX 

DT 06-NOV-2001 (first entry) 
XX 

DE Human protein SEQ ID NO 1871. 
XX 

KW Human; cytokine; cell proliferation; cell differentiation; gene therapy; 

KW vaccine; peptide therapy; stem cell growth factor; haematopoiesis ; 

KW tissue growth factor; immunomodulatory; cancer; leukaemia; 

KW nervous system disorder; arthritis; inflammation. 

XX 

OS Homo sapiens . 
XX 

PN WO200157190-A2 . 
XX 

PD 09-AUG-2001 . 
XX 

PF 05-FEB-2001; 2 001WO-US04 098 . 
XX 

PR 03-FEB-2000; 2 000US-04 96914 . 

PR 27-APR-2000; 2 000US-0560875 . 

PR 20-JUN-2000; 2 OOOUS-0598 07 5 . 

PR 19-JUL-2000; 2000US-0620325 . 

PR 01-SEP-2000; 2 OOOUS-0 654 936 . 

PR 15-SEP-2000; 2 000US-0663561 . 

PR 20-OCT-2000; 2 000US- 0693325 . 

PR 30-NOV-2000; 2000US-0728422 . 
XX 

PA (HYSE-) HYSEQ INC. 
XX 

PI Tang YT, Liu C, Drmanac RT, Asundi V, Zhou P, Xu C, Cao Y, Ma Y; 

PI Zhao QA, Wang D r Wang J, Zhang J, Ren F, Chen R, Wang ZW; 

PI Xue AJ, Yang Y, Wejhrman T, Goodrich R; 
XX 

DR WPI; 2001-476283/51. 

DR N-PSDB; AAK52342. 
XX 

PT Nucleic acids encoding polypeptides with cytokine-like activities, 

PT useful in diagnosis and gene therapy - 



XX 

PS Claim 20; Page 4256-4258; 6221pp; English. 
XX 

CC The invention relates to polynucleotides (AAK514 5 6-AAK53435 ) and the 

CC encoded polypeptides (AAM7 8323-AAM8 03 02 ) that exhibit activity elating to 

CC cytokine, cell proliferation or cell differentiation or which may induce 

CC production of other cytokines in other cell populations. The 

CC polynucleotides and polypeptides are useful in gene therapy, vaccines or 

CC peptide therapy. The polypeptides have various cytokine-like activities, 

CC e.g. stem cell growth factor activity, haematopoiesis regulating 

CC activity, tissue growth factor activity, immunomodulatory activity and 

CC activin/inhibin activity and may be useful in the diagnosis and/or 

CC treatment of cancer, leukaemia, nervous system disorders, arthritis and 

CC inflammation. 

CC Note: Records for SEQ ID NO 2110 (AAK52581), 2111 (AAK52582) and 3666 

CC (AAM80020) are omitted as the relevant pages from the sequence listing 

CC were missing at the time of publication. 
XX 

SQ Sequence 858 AA; 

Query Match 34.4%; Score 870; DB 22; Length 858; 

Best Local Similarity 37.3%; Pred. No. 4.6e-75; 

Matches 182; Conservative 95; Mismatches 189; Indels 22; Gaps 5 



Qy 




WT tt qpT t cwrF^SSDYSTrFTLGSFAKIYADTF GDI NYQEFAKRLW 

I : : : ! : I 1 1 1 1 : IN 1 1 : : 1 1 t : : : 1 : 1 1 
NIMIDPVLGTVGFGSGLHGWAFTLKQFAEMYVAKFAAKGEGQLGPAERAKKVEDMMKKLW 


56 


Db 


202 


261 


Qy 


57 


GDIYFNPKTRKFTKKAPTSSSQ RSFVEFILEPLYKILAQWGDVDTSLPRTLDELGI 

M E | : I | I : I 1 : : | : | : 1 1 : 1 : : 1 : : : : : : : 1 1 
GDRYFDPANGKFSKSATSPEGKKLPRTFCQLILDPIFKVFDAIMNFKKEETAKLIEKLDI 


113 


Db 


262 


321 


Qy 


114 


HLTKEELKLNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDL 

| | : : M | : | : : : : 1 I : f 1 1 1 : 1 II 1 = 
KLDSEDKDKEGKPLLKAVMRRWLPAGDALLQMITIHLPSPVTAQKYRCELLYEGPPDDEA 


173 


Db 


322 


381 


Qy 


174 


GEAMSDCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEED 

: | 1 | 1 1 i 1 : : 1 1 11 : 1 : 1 1 1 ! 1 1 : 1 1 : : : 1 1 1 1 'W 
AMGI KSCDPKGPLMMYI S KMVPT S DKGRFYAFGRVFSGLVSTGLKVRIMGPNYTPGKKED 


233 


Db 


382 


441 


Qy 


234 


SQICTVGRLWISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLK 

: : | : : I I : 1 1 II 1 : 1 1 1 1 : I M ! 1 1 It 1 : I 
L Y LK P I Q RT I LMMG RYVE PIE D VP C GN I VGL VG VDQ FL VKT GT ITT FEHAHNMRVMK 


293 


Db 


442 


498 


Qy 


294 


FNTTSVIKIAVEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVM 

| : : 1 : : : | | | 1 1 : : 1 1 I : : : 1 1 : : : II 1 : : 1 1 II 1 1 : 1 1 1 1 1 : 1 : • 
FSVS PVVRVAVELAKNPADLPKLVEGLKRLAKSDPMVQCIIEESGEHIIAGAGELHLEICL 


353 


Db 


499 


558 


Qy 


354 


HDLRKMYSEIDIKVADPWTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGIJVEDIE 

|| : : : I \\ : 1 1 1 1 : : III 1 1 : : 1 :: : 1 1 1 1 : : 1 1 1 HUM: 
KDLEEDHACI P I KKSDPWS YRETVSEESNVLCLSKS PNKHNRLYMKARP FPDGLAEDI D 


413 


Db 


559 


618 


Qy 


414 


NEWQITWNRKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGS 
| | : : 1 1 : 1 1 : 1 I 1 1 M II 1 1 1 1 1 1 1 1 : 1 


473 


Db 


619 


KGEVSARQELKQRARYLAEKYEWDVAEARKIWCFGPDGTGPNILTDITKGVQ YLNE 


674 


Qy 


474 


VKDSIVQG 481 





: I I I : I I 

Db 675 I KD SWAG 682 

RESULT 6 
AAM80193 

ID AAM80193 standard; Protein; 858 AA. 
XX 

AC AAM80193; 
XX 

DT 06-NOV-2001 (first entry) 
XX 

DE Human protein SEQ ID NO 3839. 
XX 

KW Human; cytokine; cell proliferation; cell differentiation; gene therapy; 

KW vaccine; peptide therapy; stem cell growth factor; haematopoiesis ; 

KW tissue growth factor; immunomodulatory; cancer; leukaemia; 

KW nervous system disorder; arthritis; inflammation. 

XX 

OS Homo sapiens. 
XX 

PN WO200157190-A2 . 
XX 

PD 09-AUG-2001. 
XX 

PF 05-FEB-2001; 2001WO-US04098 . 
XX 

PR 03-FEB-2000; 2000US-0496914 . 

PR 27-APR-2000; 2000US-0560875 . 

PR 20-JUM-2000; 2000US-0598075 . 

PR 19-JUL-2000; 2000US-O62O325 . 

PR 01-SEP-2000; 2000US-0654936 . 

PR 15-SEP-2 000; 2000US-0 663561 . 

PR 2 0-OCT-2 000; 2 000US-0 69332 5 . 

PR 30-NOV-2000; 2 OOOUS-072 8422 . 
XX 

PA (HYSE-) HYSEQ INC. 
XX 

PI Tang YT , Liu C, Drmanac RT, Asundi V, Zhou P, Xu C, Cao Y, Ma Y; 

PI Zhao QA, Wang D, Wang J, Zhang J, Ren F, Chen R, Wang ZW; 

PI Xue A J, Yang Y, Wejhrman T, Goodrich R; 
XX 

DR WPI; 2001-476283/51. 

DR N-PSDB; AAK53326. 
XX 

PT Nucleic acids encoding polypeptides with cytokine-like activities, 

PT useful in diagnosis and gene therapy - 

XX 

PS Claim 20; Page 447; 6221pp; English. 
XX 

CC The invention relates to polynucleotides ( AAK51456-AAK53435 ) and the 

CC encoded polypeptides {AAM78323-AAM80302 ) that exhibit activity elating to 

CC cytokine, cell proliferation or cell differentiation or which may induce 

CC production of other cytokines in other cell populations. The 

CC polynucleotides and polypeptides are useful in gene therapy, vaccines or 

CC peptide therapy. The polypeptides have various cytokine-like activities, 

CC e.g. stem cell growth factor activity, haematopoiesis regulating 



CC activity, tissue growth factor activity, immunomodulatory activity and 

CC activin/inhibin activity and may be useful in the diagnosis and/or 

CC treatment of cancer, leukaemia, nervous system disorders, arthritis and 

CC inflammation. 

CC Note: Records for SEQ ID NO 2110 (AAK52581), 2111 (AAK52582) and 3666 

CC (AAM80020) are omitted as the relevant pages from the sequence listing 

CC were missing at the time of publication. 
XX 

SQ Sequence 858 AA; 

Query Match 34.4%; Score 870; DB 22; Length 858; 

Best Local Similarity 37.3%; Pred. No. 4.6e-75; 

Matches 182; Conservative 95; Mismatches 189; Indels 22; Gaps 



Air 

yy 


9 


NLILSPLLGNVCFSSSQYSICFTLGSFAKI YADTF GDI NYQEFAKRLW 

| : : : I : I 1 1 1 1 : III 1 1 : : 1 1 1 = - 1 : 1 1 

NIMIDPVLGTVGFGSGLHGWAFTLKQFAEMYVAKFAAKGEGQLGPAERAKKVEDMMKKLW 


56 


LJD 




261 


Qy 


■D I 


nilTYFMPKTRKFTKKAPTSSSO RSFVEFILEPLYKILAQWGDVDTSLPRTLDELGI 


113 


Db 


262 


I | I | : I I I : 1 I : : | : | : | | : I : : I : : : : : : : 1 1 
GDRYFDPANGKFSKSATSPEGKKLPRTFCQLILDPIFKVFDAIMNFKKEETAKLIEKLDI 


321 


Ov 


114 


HLTKEELKLNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDL 


173 


Db 


322 


| | : : | | I : | :: : : I ! : 1 1 1 1 : 1 II 1 : 

KLDSEDKDKEGKPLLKAVMRRWLPAGDALLQMITIHLPSPVTAQKYRCELLYEGPPDDEA 


381 


Qy 


174 


GEAMSDCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEED 

: I I I II 1 1 :: 1 I II : 1 : 1 1 1 1 1 1 : 1 1 : : : 1 1 1 1 : M 
AMGIKSCDPKGPLMMYI SKMVPTSDKGRFYAFGRVFSGLVSTGLKVRIMGPNYTPGKKED 


Zoo 


Db 


382 


441 


Qy 


234 


S Q I CT VG RLW I S VARYH I EVN RVP AGNWVL I E GVDQ P I VKT AT I T E P RGN E EAQ I F RP L K 
: : | : : M : II 1 I 1 : 1 1 I I : M 1 1 1 1 II 1 : 1 
L Y L K P I Q RT I LMMG R YVE PIE D VP C GN I VGL VG VDQ FL VKT GT ITT FEHAHNMRVMK 


293 


Db 


442 


498 


Qy 


294 


FNTTSVIKIAVEPWPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVM 

| : : | : : : M 1 I I : : I I I : : : 1 1 : : : 11 1 : : 1 i 1 1 M : 1 1 1 1 1 : 1 : : 

FSVSPWRVAVEAKNPADLPKLVEGLKRLAKSDPMVQCIIEESGEHIIAGAGELHLEICL 


353 


Db 


499 


558 


Qy 


354 


HDLRKMYSEIDIKVADPWTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIE 

|| : : : I II : 1 1 1 1 : : 1 1 1 1 1 : : 1 : : : 1 1 1 1 = = 1 1 1 MINI: 
KDLEEDHACI PIKKSDPWSYRETVSEESNVLCLSKSPNKHNRLYMKARPFPDGLAEDID 


413 


Db 


559 


618 


Qy 


414 


NEWQITWNRKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGS 

| | : : 1 1 : 1 1 : II II INI Itllll 1 1 : 1 
KGEVSARQELKQRARYLAEKYEWDVAEARKIWCFGPDGTGPNILTDITKGVQ YLNE 


473 


Db 


619 


674 


Qy 


474 


VKDSIVQG 4 81 

: 1 1 1 : 1 1 

I KD SWAG 682 




Db 


675 





RESULT 7 
AAP7 00 9 9 

ID AAP70099 standard; Protein; 858 AA. 
XX 

AC AAP70099; 
XX 



DT 03-OCT-2002 (updated) 

DT 18-APR-1991 (first entry) 

XX 

DE Sequence of elongation factor 2 (EF-2) in plasmid pHEW 1. 
XX 

KW Protein biosynthesis; diptheria toxin; Ps aeruginosa toxin; 

KW resistant; selective marker. 

XX 

OS Rattus rattus . 
XX 

PN EP217327-A. 
XX 

PD 08-APR-1987. 
XX 

PF 29-SEP-1986; 8 6EP-011334 9 . 
XX 

PR 30-SEP-1985; 8 5 JP-02 19545 . 
XX 

PA (SUMO ) SUMITOMO CHEM IND KK. 
XX 

PI Uchida T; 
XX 

DR WPI; 1987-095363/14. 

DR N-PSDB; AAN70159. 
XX 

PT New DNA sequence encoding elongation factor 2 - is useful as 

PT selective marker for animal cells transformed with exogenous DNA 

PT from un-trans formed cells 

XX 

PS Disclosure; Fig. 1; 38pp; English. 
XX 

CC EF-2 is esssntial for the translocation of peptidyl- tRNA from A- 

CC site to P-site on the ribosome. It is inactivated through formation 

CC of a covalent bond with ADP ribose derived from NAD by the action of 

CC diptheria toxin or Ps . aeruginosa exotoxin, and so cells exposed to 

CC the toxin may die. A variant EF-2 (claimed) is resistant to diptheria 

CC toxin and Pseudomonas aeruginosa exotoxin. 

CC (Updated on 03-OCT-2002 to add missing OS field.) 
XX 

SQ Sequence 858 AA; 

Query Match 34.2%; Score 863; DB 8; Length 858; 
Best Local Similarity 37.1%; Pred. No. 2.2e-74; 

Matches 181; Conservative 95; Mismatches 190; Indels 22; Gaps 5; 

Qy 9 NLILSPLLGNVCFSSSQYSICFTLGSFAKI YADTF GDI NYQEFAKRLW 56 

i : : : I : 1 1 I I I : It! i 1 : : I I I = : : ' : ' ' 

Db 2 02 NIMIDPVLGTVGFGSGLHGWAFTLKQFAEMWAKFAAKGEGQLGPAERAKKVEDMMKKLW 2 61 

Qy 57 GDI YFNPKT RKFTKKAPTSSSQ RSFVEFILEPLYKILAQWGDVDTSLPRTLDELGI 113 

I | I I : I I I : I I : : I : I : I I : I : : I : : : : : I I 

Db 262 GDRYFDPANGKFSKSANSPDGKKLPRTFCQLILDPIFKVFDAIMNFRKEETAKLIEKLDI 321 

Qy 114 HLTKEELKLNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDL 173 

||: :|||: | ::: : I |:||| I : I I I I : 

Db 322 KLDSEDKDKEGKPLLKAVMRRWLPAGDALLQMITIHLPSPVTAQKYRCELLYEGPPDDEA 381 



Ov 


174 


GEAMSDCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEED 

: 1 1 1 1 I 1 1 : : E 1 II : I : I i I 1 1 1 : 1 I : : : I 1 1 1 : 1 : 
AMGIKSCDPKGPLMMYISKMVPTSDKGRFYAFGRVFSGWSTGLKVRIMGPNYTPGKKEE 


233 


Db 


382 


441 


Ov 


234 


SQICTVGRLWISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLK 

: : I : : [ I : II II 1 : MM -III Ml II 1 M 
LYLKPIQRTILMMGRYVEPIEDVPCGNIVGLVGVDQFLVKTGTITT FEHAHNMRVMK 


293 


Db 


442 


498 


Ov 


294 


FNTTSVIKIAVEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVM 

I : : I : : : I M |r::||:i: M 1 : M 1 1 M I M 1 111 = 1= : 

FSVSPWRVAVE7VKNPADLPKLVEGLKRLAKSDPMVQCIIEESGEHIIAGAGELHLEICL 


353 


Db 


499 


558 


Qy 


354 


HDLRKMYSEIDIKVADPVVTFCETVVETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIE 

M : : : [ 1 1 M 1 1 1 : : 1 M 1 1 : : 1 : : M II 1 : : 1 1 1 M 1 1 M : 
KDLEEDHACIPIKKSDPWSYRETVSEESNVLCLSKSPNKHNRLYMKARPFPDGLAEDID 


413 


Db 


559 


618 


Qy 


414 


NEWQITWNRKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGS 

I I : 1 I :\ 1 : 1 t M 1 II 1 1 1 II 1 1 1 1 ' 1 
KGEVSARQELKARARYLAEKYEWDVAEARKIWCFGPDGTGPNILTDITKGVQ YLNE 


473 


Db 


619 


674 


Qy 


474 


VKDSIVQG 4 81 
M M M 1 
IKDSWAG 682 




Db 


675 





RESULT 8 


AAP70100 


ID 


AAP70100 standard; Protein; 858 AA. 


XX 




AC 


AAP70100; 


XX 




DT 


03-OCT-2 002 (updated) 


DT 


18-APR-1991 (first entry) 


XX 




DE 


Sequence of variant of elongation factor 2 (EF-2), in 


DE 


plasmid pHED 1, resistant to diptheria toxin and Pseudomonas 


DE 


aeruginosa exotoxin. 


XX 




KW 


Protein biosynthesis; diptheria toxin; Ps aeruginosa toxin; 


KW 


resistant; selective marker. 


XX 




OS 


Rattus rattus . 


XX 




PN 


EP217327-A. 


XX 




PD 


08-APR-1987 . 


XX 




PF 


2 9-SEP-198 6; 8 6EP-011334 9 . 


XX 




PR 


30-SEP-198 5; 8 5 JP- 02 19545 . 


XX 




PA 


( SUMO ) SUMITOMO CHEM IND KK. 


XX 




PI 


Uchida T; 


XX 




DR 


WPI; 1987-095363/14. 


DR 


N-PSDB; AAN70160. 



XX 

PT New DNA sequence encoding elongation factor 2 - is useful as 

PT selective marker for animal cells transformed with exogenous DNA 

PT from un-trans formed cells 

XX 

PS Disclosure; Fig 2; 38pp; English. 
XX 

CC EF-2 is esssntial for the translocation of peptidyl- tRNA from A- 

CC site to P-site on the ribosome. It is inactivated through formation 

CC of a covalent bond with ADP ribose derived from NAD by the action of 

CC diptheria toxin or Ps . aeruginosa exotoxin, and so cells exposed to 

CC the toxin may die. A variant EF-2 (claimed) is resistant to diptheria 

CC toxin and Pseudomonas aeruginosa exotoxin. 

CC {Updated on 03-OCT-2002 to add missing OS field.) 
XX 

SQ Sequence 858 AA; 

Query Match 34.2%; Score 863; DB 8; Length 858; 

Best Local Similarity 37.1%; Pred. No. 2.2e-74; 

Matches 181; Conservative 95; Mismatches 190; Indels 22; Gaps 5 

Qy 9 NLILSPLLGNVCFSSSQYSICFTLGSFAKI YADTF GDI NYQEFAKRLW 56 

I : : : I : M I I I : III I I : : I I I : : : | : I I 

Db 2 02 NIMIDPVLGTVGFGSGLHGWAFTLKQFAEMYVAKFAAKGEGQLGPAERAKKVEDMMKKLW 2 61 



Qy 57 GDI YFNPKTRKFTKKAPTSSSQ RS FVEFI LEPLYKI LAQWGDVDTS LPRTLDELGI 113 

I I I I : I I I : I I : : | : | : I I : I : : I : : : : : : : I I 

Db 262 GDRYFDPANGKFSKSANSPDGKKLPRTFCQLILDPIFKVFDAIMNFRKEETAKLIEKLDI 321 



Qy 114 HLTKEELKLNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDL 173 

| | : : | | | : | : : : : I I : I I I I : I II I : 

Db 322 KLDSEDKDKEGKPLLPCAVMRRWLPAGD7VLLQMITIHLPSPVTAQKYRCELLYEGPPDDEA 381 

Qy 174 GEAMSDCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEED 233 

: I M 1 I I I : : II II : I = II i I I I : I I : : : I I i 1 : I : 

Db 382 AMGIKSCDPKGPLMMYISKMVPTSDKGRFYAFGRVFSGWSTGLKVRIMGPNYTPGKKEE 441 

Qy 234 SQICTVGRLWISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLK 293 

: : I : : I I : I I I I I : I I I I : I I I I I I II I : I 

Db 4 42 L YLK P I QRT I LMMGRYVE PIE DVP C GN I VGLVGVD Q FLVKT GT ITT FEHAHNMRVMK 498 

Qy 2 94 FNTTSVIKIAVEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVM 353 

| : : | : : : I I I I I : : I I I : : : I I : : : II 1 : : I I I I I I : I I I I I : I : : 

Db 499 FSVSPWRVAVEAKNPADLPKLVEGLKRLAKSDPMVQCIIEESGEHIIAGAGELHLEICL 558 

Qy 354 HDLRKMYSEIDIKVADPWTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIE 413 

II : : : I II : I I I I : : I I I I I : : I : : : I I I I : : I I I 111111 = 

Db 559 KDLEEDHACIPIKKSDPWSYRETVSEESNVLCLSKSPNKHNRLYMKARPFPDGLAEDID 618 

Qy 414 NEWQITWNRKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGS 473 

I I : I I : I I : I I IE I I I I I I I I I I I t : I 

Db 619 KGEVSARQELKARARYLAEKYEWDVAEARKIWCFGPDGTGPNILTDITKGVQ YLNE 674 



Qy 474 VKDSIVQG 481 

: I I I : I I 
Db 675 IKDSWAG 682 



RESULT 9 
ABB67414 

ID ABB67414 standard; Protein; 699 AA. 
XX 

AC ABB67414; 
XX 

DT 26-MAR-2002 (first entry) 
XX 

DE Drosophila melanogaster polypeptide SEQ ID NO 29034. 
XX 

KW Drosophila; developmental biology; cell signalling; insecticide; 

KW pharmaceutical . 

XX 

OS Drosophila melanogaster. 
XX 

PN WO200171042-A2 . 
XX 

PD 27-SEP-2001. 
XX 

PF 2 3-MAR-2 001; 2 001WO-US09231 . 
XX 

PR 23-MAR-2000; 2000US-191637P . 

PR ll-JUL-2000; 2 000US-0614 150 . 
XX 

PA (PEKE ) PE CORP NY. 
XX 

PI Venter JC, Adams M, Li PWD, Myers EW; 
XX 

DR WPI; 2001-656860/75. 

DR N-PSDB; ABL11517. 
XX 

PT New isolated nucleic acid detection reagent for detecting 1000 or more 

PT genes from Drosophila and for elucidating cell signalling and cell-cell 

PT interactions - 
XX 

PS Disclosure; SEQ ID NO 29034; 21pp + Sequence Listing; English. 
XX 

CC The invention relates to an isolated nucleic acid detection reagent 

CC capable of detecting 1000 or more genes from Drosophila. The invention is 

CC useful in developmental biology and in elucidating cell signalling and 

CC cell-cell interactions in higher eukaryotes for the development of 

CC insecticides, therapeutics and pharmaceutical drugs. The invention 

CC discloses genomic DNA sequences { ABL1617 6-ABL30511 ) , expressed DNA 

CC sequences {ABL0184 0-ABL16175 ) and the encoded proteins 

CC (ABB57737-ABB72072) . 

CC The sequence data for this patent did not form part of the printed 

CC specification, but was obtained in electronic format directly from WIPO 

CC at ftp . wipo . int/pub/published_pct_sequences . 

XX 

SQ Sequence 699 AA; 

Query Match 33.4%; Score 844.5; DB 22; Length 699; 

Best Local Similarity 37.2%; Pred. No. 9.9e-73; 

Matches 175; Conservative 103; Mismatches 183; Indels 9; Gaps 4 

Qy 12 LSPLLGNVCFSSSQYSICFTLGSFAKIYADTFGDINYQEFAKRLWGDI YFNPKTRKFTKK 71 



: I | : j | | : Ml I :: : I : : I I : : I I I I : : I I I I = I : : I 

Db 196 VDP S KGS VGFGS GLHGWAFT LKQ FS EMYS EKF- KI DVVKLMNRLWGEN FFNAKTKKW- QK 253 

Qy 72 APTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLNIRPLLRLV 131 

: : : | I | : I I : I : M : : : : I : : : I : I I = : : I I : I 

Db 254 QKEADNKRSFCMYILDPIYKVFDAIMNYKKEEIGTLLEKIGVTLKHEDKDKDGKALLKTV 313 

Qy 132 CKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDPDGPLMCHTT 191 

: : : I ! : I I I I I : : I II I : I : I I I I I I I I : : 

Db 314 MRTWLPAGEALLQMIAIHLPSPWAQKYRMEMLYEGPHDDEAAIAVKSCDPDGPLMMYIS 373 

Qy 192 KMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLWISVARYHI 251 

M || : I : I I I I : t : I I : : : I I I I : I I : I : : II 

Db 374 KMVPTSDKGRFYAFGRVFAGKVATGQKCRIMGPNYTPGKKEDLYEKAIQRTILMMGRYVE 433 

Qy 252 EVNRVPAGNWVL I EGVDQ P I VKTAT I TE P RGNEEAQI FRPLKFNTT S VI KI AVEPVN P S E 311 

: | | : i I : I I 1 I : I I I I I I : : I : : | I : : I : : : I I I I I I : : 

Db 434 AI E D VP S GN I C GL VGVDQ F L VKT GT ITT FKDAHNMKVMKFS VS P WRVAVE P KN PAD 490 

Qy 312 LPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEIDIKVADPV 371 

M | : : : | | : : : I 1 [ : : I I M I I : I I I I t : I : : | I : : : I : I : I I I 
Db 491 LPKLVEGLKRLAKSDPMVQCIIEESGEHI IAGAGELHLEICLKDLEEDHACI PLKKSDPV 550 

Qy 372 VTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGIAEDIENEWQITWNRKKLGEFFQ 431 

| :: | I I I I | : : : I I I I : : I I I : I I I I I : I 1 I : 

D b 551 VSYRETVSEESDQMCLSKSPNKHNRLLMKALPMPDGLPEDIDNGDVSAKDEFKARARYLS 610 

Qy 432 TKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGSVKDSIVQG 481 

1 I I : I : I I i I I I I I I I I I : : I I ' 1 : I I I : I I 

Db 611 EKYDYDVTEARKIWCFGPDGTGPNFILDCTKSVQ YLNEIKDSWAG 656 



RESULT 10 




ABB58973 




ID 


ABB58973 standard; Protein; 832 AA. 




XX 






AC 


ABB58973; 




XX 






DT 


26-MAR-2002 (first entry) 




XX 






DE 


Drosophila melanogaster polypeptide SEQ 


ID NO 3711. 


XX 




signalling; insecticide; 


KW 


Drosophila; developmental biology; cell 


KW 


pharmaceutical . 




XX 






OS 


Drosophila melanogaster. 




XX 






PN 


WO200171042-A2 . 




XX 






PD 


27-SEP-2001. 




XX 






PF 


23-MAR-2 001; 2 00 1WO-US092 31 . 




XX 






PR 


23-MAR-2 00 0; 2 000US-1 9 1637P . 




PR 


ll-JUL-20 00; 2 00 0US-0 61415 0. 




XX 






PA 


(PEKE ) PE CORP NY. 





XX 

PI Venter JC, Adams M, Li PWD, Myers EW; 
XX 

DR WPI; 2001-656860/75. 

DR N-PSDB; ABL03076. 
XX 

PT New isolated nucleic acid detection reagent for detecting 1000 or more 

PT genes from Drosophila and for elucidating cell signalling and cell-cell 

PT interactions - 
XX 

PS Disclosure; SEQ ID NO 3711; 21pp + Sequence Listing; English. 
XX 

CC The invention relates to an isolated nucleic acid detection reagent 

CC capable of detecting 1000 or more genes from Drosophila. The invention is 

CC useful in developmental biology and in elucidating cell signalling and 

CC cell-cell interactions in higher eukaryotes for the development of 

CC insecticides, therapeutics and pharmaceutical drugs. The invention 

CC discloses genomic DNA sequences (ABL1617 6-ABL30511 ) , expressed DNA 

CC sequences (ABL01840-ABL16175 ) and the encoded proteins 

CC (ABB57737-ABB72072) . 

CC The sequence data for this patent did not form part of the printed 

CC specification, but was obtained in electronic format directly from WIPO 

CC at ftp.wipo.int/pub/published_pct_sequences. 

XX 

SQ Sequence 832 AA; 

Query Match 33.4%; Score 844.5; DB 22; Length 832; 

Best Local Similarity 37.2%; Pred. No. 1.3e-72; 

Matches 175; Conservative 103; Mismatches 183; Indels 9; Gaps 4 



Qy 


12 


LSPLLGNVCFSSSQYSICFTLGSFAKIYADTFGDINYQEFAKRLWGDI YFNPKTRKFTKK 

: I | : | I E : 1 1 1 | : : : I : : I ! : : M 1 1 : : 1 1 f 1 : 1 : : 1 
VDPSKGSVGFGSGLHGWAFTLKQFSEMYSEKF-KIDWKLMNRLWGENFFNAKTKKW-QK 


71 


Db 


196 


253 


Qy 


72 


APTS S SQRS FVEFI LEPLYKI LAQWGDVDT SLPRTLDELGI HLTKEELKLNI RPLLRLV 

: : : | M : 1 i : 1 : ! 1 : : : : 1 : : : 1 : 1 1 : : : 1 1 : 1 
QKEADNKRSFCMYILDPIYKVFDAIMNYKKEEIGTLLEKIGVTLKHEDKDKDGKALLKTV 


131 


Db 


254 


313 


Qy 


132 


CKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDPDGPLMCHTT 

: : : I I : 1 1 1 1 i : : 1 II 1 : i : MINIM:: 
MRTWLPAGEALLQMIAIHLPSPWAQKYRMEMLYEGPHDDEAAIAVKSCDPDGPLMMYIS 


191 


Db 


314 


373 


Qy 


192 


KMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLWISVARYHI 

II II : I : 1 1 1 1 : 1 : II : : : 1 1 1 1 : II : 1 : : 1 1 

KMVPTSDKGRFYAFGRVFAGKVATGQKCRIMGPNYTPGKKEDLYEKAIQRTILMMGRYVE 


251 


Db 


374 


433 


Qy 


252 


EVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIAVEPWPSE 
: INN : 1 II 1 : 1 1 1 1 ! 1 : : 1 : MM : 1 : : : 1 1 II N : : 
AI E D VP S GN ICG L VGVD Q F L VKT GT ITT FKDAHNMKVMKF S VS P WRVAVE P KN PAD 


311 


Db 


434 


490 


Qy 


312 


LPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEIDIKVADPV 

[| 1 : : : 1 1 : : : N 1 : : II 1 1 N : 1 1 1 1 1 : 1 : : N : : : 1 : 1 Nil 

LPKLVEGLKRLAKSDPMVQCI I EESGEHI IAGAGELHLEICLKDLEEDHACI PLKKSDPV 


371 


Db 


491 


550 


Qy 


372 


VTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWNRKKLGEFFQ 

1 :: 1 1 1 1 1 1 : : N 1 1 1 : : 1 1 1 : MINN 1 1 : 
VSYRETVSEESDQMCLSKSPNKHNRLLMKALPMPDGLPEDIDNGDVSAKDEFKARARYLS 


431 


Db 


551 


610 



Qy 432 TKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGSVKDSIVQG 481 

I I I : I : I I I 1 M I I I I I I : : I I : ' : I I I = I I 

Db 611 EKYDYDVTEARKIWCFGPDGTGPNFILDCTKSVQ YLNEI KDSWAG 656 



Amino acid sequecne of a translation elongation factor-2 (EF-2) . 

Translation elongation factor-2; EF-2; yeast; diphtheria; toxin; 
diphthamide; DPH1; DPH2 ; DPH3; DPH4; DPH5; toxin resistance; EFTl; 
selective killing; microorganism; microcidal toxin; biological control. 



RESULT 11 
AAY43637 

ID AAY43637 standard; Protein; 842 AA 
XX 

AC AAY43637; 
XX 

DT ll-FEB-2000 (first entry) 
XX 
DE 
XX 
KW 
KW 
KW 
XX 

OS Saccharomyces cerevisiae. 
XX 

FH Key Location/Qualifiers 
FT Modified-site 699 

F T /note= "this residue is pos t-translationally 

FT modified to diphthamide by the DPH gene 

FT products" 
XX 

PN W09953762-A1 . 
XX 

PD 28-OCT-1999. 
XX 

PF 16-APR-1999; 99WO-US08213 . 
XX 

PR 17-APR-1998; 98US-0082089 . 
XX 

PA (MICR-) MICROBIA INC. 
XX 

PI Milne GT, Fink GR; 
XX 

DR WPI; 2000-013163/01. 
DR N-PSDB; AAZ30303. 
XX 

PT Selective biological control of microorganisms 
XX 

PS Example 2; Fig 3A-E; 75pp; English. 

CC The present sequence represents the translation elongation factor-2 

CC (EF-2), designated EFTl (for EFT2 see AAY43638). Diphtheria toxin 

CC catalyses the ADP-ribosylation of EF-2 on a diphthamide residue. Mutants 

CC that fail to make diphthamide or lack the histidine precursor of 

CC diphthamide are resistant to the toxin. Therefore, mutations in genes 

CC involved in this (EF-2 (2 genes), DPH1, DPH2, DPH3 , DPH4, and DPH5 ) are 

CC sufficient to provide resistance to diphtheria toxin. Deletion of any of 

CC the DPH genes is sufficient to yield nearly complete resistance to 

CC diphtheria toxin. The DPH toxin phenotype is completely recessive, 

CC while the EF-2 toxin phenotype is generally dominant. This is utilised 



CC in the method of the invention. The specification describes a method 

CC for the selective killing of a microorganism (Ml) . This is achieved by 

CC contacting it with another microorganism (M2 ) producing a microcidal 

CC toxin such as diphtheria. Ml and M2 are allowed to fuse and thus the 

CC toxin is delivered into, and kills the microorganism formed following 

CC fusion. M2 has been modified so that it is resistant to the toxin, 

CC e.g. by introducing mutations in DPH1, DPH3 or DPH4 genes that decreases 

CC biosynthesis of diphthamide. The method is used for selective biological 

CC control of harmful microorganisms. 
XX 

SQ Sequence 842 AA; 

Query Match 32.0%; Score 807.5; DB 21; Length 842; 

Best Local Similarity 35.7%; Pred. No. 5.4e-69; 

Matches 168; Conservative 106; Mismatches 182; Indels 15; Gaps i 





14 


PLLGNVCFSSSQYSICFTLGSFAKI YADTFGDINYQEFAKRLWGDI YFNPKTRKFTKK 

| | | | 1 : It: II M ll=: = 1 1 1 1 1 : 1 1 1 1 1 : 1 : 1 1 
PARGTVAFGSGLHGWAFTIRQFATRYAKKFG— VDKAKMMDRLWGDSFFNPKTKKWTNKDT 


71 


Db 


204 


262 


Ov 


72 


-APTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLNIRPLLRL 

| S |:| |||:l:::: :: M |::l 11 =1 1 * M = = 
a nri/DT FDATruMTrTT nPTTTRT FT 7X TNrWFKKDF.T PVLLEKLEI VLKGDEKDLEGKALLKV 


130 


Db 


263 


322 


Qy 


131 


VCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDPDGPLMCHT 


190 


Db 


323 


| : | | : : I 1 1 : I 1 1 : 1 II 1 1 : : 1 1 1 M : 
VMRKFLPAADALLEMIVLHLPSPVTAQAYRAEQLYEGPADDANCIA1 KNCDPKADLMLYV 


382 


Qy 


191 


TKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLWI SVARYH 

:|| || : | : | 1 M : 1 1 : : 1 1 1 : : 1 1 1 : : 1 1 : 1 : : : 1 : 

S KMVPT S DKGRFYAFGRVFAGTVKS GQKVRI QGPN YVPGKKDDLFI KAI QRWLMMGRFV 


250 


Db 


383 


442 


Qy 


251 


1EVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIAVEPVNPS 
: : 1 | 1 | : : | : | | : : 1 1 1 : 1 : 1 1 : : | I : : 1 : : : 1 It 1 : 
EP I DDCPAGNI I GLVGI DQFLLKTGTLTT S ET AHNMKVMKFSVS PVVQ VAVEVKNAN 


310 


Db 


443 


499 


Qy 


311 


ELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEIDIKVADP 

: 1 | | : : : 1 | : : : : M 1 : 1 : Mill:: | 1 | I I : 1 : : II : : : M : : 1 
DLPKLVEGLKRLSKSDPCVLTYMSESGEHIVAGTGELHLEICLQDLEHDHAGVPLKISPP 


370 


Db 


500 


559 


Qy 


371 


WTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWNRKKLGEFF 

| | : M | M : : : I 1 1 1 : 1 : 1 M : : : :: III:: : 1 
WAYRETVESESSQTALSKSPNKHNRI YLKAEPIDEEVSLAI ENGI 1NPRDDFKARARIM 


430 


Db 


560 


619 


Qy 


431 


QTKYDWDLIiAARSIWAFGPDATGPNILVDDTLPSEVDKAL LGSVKDSI V 47 9 

III: 1 1 II M 1 1 1 1 1 : : : 1 1 1 1 : 1 : II 1 : 1 

ADDYGWDVTDARKIWCFGPDGNGPNLVIDQT KAVQYLHEIKDSW 664 




Db 


620 





RESULT 12 
AAY43638 

ID AAY43638 standard; Protein; 842 AA. 
XX 

AC AAY43638; 
XX 

DT ll-FEB-2000 (first entry) 
XX 



DE Amino acid sequecne of a translation elongation factor-2 (EF-2) . 
XX 

KW Translation elongation factor-2; EF-2; yeast; diphtheria; toxin; 

KW diphthamide; DPH1; DPH2 ; DPH3; DPH4; DPH5; toxin resistance; EFTl; 

KW selective killing; microorganism; microcidal toxin; biological control. 

XX 

OS Saccharomyces cerevisiae. 
XX 

FH Key Location/ Qualifiers 

FT Modif ied-site 699 

FT /note= "this residue is pos t-translationally 

FT modified to diphthamide by the DPH gene 

FT products" 
XX 

PN W09953762-A1. 
XX 

PD 28-OCT-1999. 
XX 

PF 16-APR-1999; 99WO-US08213 . 
XX 

PR 17-APR-1998; 98US-0082089 . 
XX 

PA (MICR-) MICROBIA INC. 
XX 

PI Milne GT, Fink GR; 
XX 

DR WPI; 2000-013163/01. 

DR N-PSDB; AAZ30304. 
XX 

PT Selective biological control of microorganisms 
XX 

PS Example 2; Fig 4A-E; 75pp; English. 
XX 

CC The present sequence represents the translation elongation factor-2 

CC (EF-2), designated EFT2 (for EFTl see AAY43637). Diphtheria toxin 

CC catalyses the ADP-ribosylation of EF-2 on a diphthamide residue. Mutants 

CC that fail to make diphthamide or lack the histidine precursor of 

CC diphthamide are resistant to the toxin. Therefore, mutations in genes 

CC involved in this (EF-2 (2 genes), DPH1, DPH2, DPH3, DPH4, and DPH5) are 

CC sufficient to provide resistance to diphtheria toxin. Deletion of any of 

CC the DPH genes is sufficient to yield nearly complete resistance to 

CC diphtheria toxin. The DPH toxin phenotype is completely recessive, 

CC while the EF-2 toxin phenotype is generally dominant. This is utilised 

CC in the method of the invention. The specification describes a method 

CC for the selective killing of a microorganism (Ml) . This is achieved by 

CC contacting it with another microorganism (M2) producing a microcidal 

CC toxin such as diphtheria. Ml and M2 are allowed to fuse and thus the 

CC toxin is delivered into, and kills the microorganism formed following 

CC fusion. M2 has been modified so that it is resistant to the toxin, 

CC e.g. by introducing mutations in DPH1, DPH3 or DPH4 genes that decreases 

CC biosynthesis of diphthamide. The method is used for selective biological 

CC control of harmful microorganisms . 

XX 

SQ Sequence 842 AA; 



Query Match 32.0%; Score 807.5; DB 21; Length 842; 

Best Local Similarity 35.7%; Pred. No. 5.4e-69; 



Matches 168; Conservative 106; Mismatches 182; Indels 15; Gaps 



5; 



Qy 14 PLLGNVCFSSSQYSICFTLGSFAKI YADTFGDINYQEFAKRLWGDIYFNPKTRKFTKK — 71 

I I i I I : II: II II II:: : lilt! : I I I I I : I : I I 

Db 204 PARGT VAFGS GLHGWAFT I RQFAT RYAKKFG— VDKAKMMDRLWGD S FFN P KTKKWTNKDT 262 

Qy 72 -APTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLNIRPLLRL 130 

I : I : I | | | : | : : : : : : : I I : : I I I : I I : II-: 

Db 263 DAEGKPLERAFNMFILDPIFRLFTAIMNFKKDEIPVLLEKLEIVLKGDEKDLEGKALLKV 322 

Qy 131 VCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDPDGPLMCHT 190 

j : I I :: I I I : I I I : I 11 I I : : I I I I I : 

Db 323 VMRKFLPAADAXjLEMIVLHLPSPVTAQAYRAEQLYEGPADDANCIAI KNCDPKADLMLYV 382 

Qy 191 TKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLWISVARYH 250 

: M II : I : I I I I : I I : : I I i : : I I I : : I I : I : : : I : 

Db 383 SKMVPTSDKGRFYAFGRVFAGTVKSGQKVRIQGPNYVPGKKDDLFIKAIQRWLMMGRFV 442 

Qy 251 IEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIAVEPVNPS 310 

: : I I I I : : I : I I : : I I I : I : I I : : | | : : | : : : I I I I : 

Db 443 EPIDDCPAGNIIGLVGIDQFLLKTGTLTT S ETAHNMKVMKFS VS P WQVAVEVKNAN 499 

Qy 311 ELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEIDIKVADP 370 

: I I I : : : M : : : : I I I : I : Mill:: | || I I : I : : M : : : : I : : I 
Db 500 DLPKLVEGLKRLSKSDPCVXTYMSESGEHIVAGTGELHLEICLQDLEHDHAGVPLKISPP 559 

Qy 371 VVTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWNRKKLGEFF 430 

M : II I I I : : : I M I : I : I I I : : : : : III:: : I 

Db 560 WAYRETVESESSQTALSKSPNKHNRI YLKAEPIDEEVSLAI ENGI INPRDDFKARARIM 619 

Qy 431 QTKYDWDLLAARS IWAFGPDATGPNI LVDDTLP SEVDKAL — LGSVKDSIV 479 

MM I I II I I II I I I : : : I I I I : 1 : I II : I 

Db 620 ADDYGWDVTDARKIWCFGPDGNGPNLVTDQT KAVQYLHEIKDSW 664 



RESULT 13 


ABJ25942 


ID 


ABJ25942 standard; Protein; 884 AA. 


XX 




AC 


ABJ25942; 


XX 




DT 


16-APR-2003 (first entry) 


XX 




DE 


Aspergillus fumigatus essential gene protein #600. 


XX 




KW 


Fungicide; cytostatic; essential gene; Aspergillus fumigatus; infection; 


KW 


cancer; contamination; biofilm; antibody; immune response. 


XX 




OS 


Aspergillus fumigatus . 


XX 




PN 


WO200286090-A2. 


XX 




PD 


31-OCT-2002 . 


XX 




PF 


23-APR-2002; 2002WO-US13142 . 


XX 




PR 


23-APR-2001; 2001US-285697P . 



PR 27-APR-2001; 2001US-287 066P . 

PR 05-JUN-2001; 2001US-295890P . 

PR 09-JUL-2001; 2001US-303899P . 

PR 31-AUG-2001; 2001US-316362P . 
XX 

PA { ELIT— ) ELITRA PHARM INC. 
XX 

PI Jiang B, Tishkoff D, Zamudio C, Eroshkin AM, Hu W, Lemieux SM; 
XX 

DR WPI; 2003-093124/08. 
XX 

PT New purified or isolated nucleic acids of essential genes of 

PT Aspergillus fumigatus, useful for treating or preventing infections by 

PT A. fumigatus, or for treating a non-infectious disease in a subject 

PT e.g. cancer - 

XX 

PS Disclosure; Page -; 175pp; English. 
XX 

CC The invention relates to novel purified or isolated nucleic acids of 

CC essential genes of Aspergillus fumigatus. The isolated nucleic acids of 

CC the invention are used to treat or prevent infections by a pathogenic 

CC organism such as A. fumigatus, to treat a non-infectious disease in a 

CC subject (e.g. cancer), to prevent or contain contamination of an object 

CC by A. fumigatus, or to prevent or inhibit formation on a surface of a 

CC biofilm comprising A. fumigatus. The polynucleotides are useful for 

CC expressing recombinant protein for characterisation, screening or 

CC therapeutic use, as markers for host tissues in which the pathogenic 

CC organisms invade or reside, for comparing with the DNA sequence of A. 

CC fumigatus to identify duplicated genes or paralogues having the same or 

CC similar biochemical activity and/or function, for comparing with DNA 

CC sequences of other related or distant pathogenic organisms to identify 

CC potential orthologous essential or virulence genes, for selecting and 

CC making oligomers for attachment to a nucleic acid array for examination 

CC of expression patterns, for raising anti-protein antibodies, as an 

CC antigen to raise anti-DNA antibodies or to elicit another immune 

CC response, and for identifying polynucleotides encoding the other protein 

CC with which binding occurs or to identify inhibitors of the binding 

CC interaction. The polypeptides may be used to raise antibodies or to 

CC elicit immune response, as a reagent in assays designed to quantitatively 

CC determine levels of the protein in biological fluids, as a marker for 

CC host tissues in which pathogenic organism invade or reside, and to 

CC isolate correlative receptors or ligands in the case or virulence 

CC factors. This sequence represents a protein of one of the essential genes 

CC of Aspergillus fumigatus of the invention. 

XX 

SQ Sequence 884 AA; 



Query Match 31.9%; Score 806; DB 24; Length 884; 

Best Local Similarity 36.0%; Pred. No. 8.3e-69; 

Matches 169; Conservative 104; Mismatches 183; Indels 14; Gaps 5 

Qy 14 PLLGNVCFSSSQYSICFTLGSFAKI YADTFGDINYQEFAKRLWGDIYFNPKTRKFTKKAP 73 

I |||j : I I : II II II:::: : I I I I I I I I I : I : I : I I 

Db 201 PDRGTVAFGSGLHGWAFTVRQFAVKYAKKFG— VDRKKMLERLWGDNYFNPQTKKWTKTG- 258 



Qy 74 TSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLNIRPLLRLVCK 133 



Db 



259 -EPEQRAFNMFILDPIFKIFAAVNNDKTEEIHKLVEKLEIKLASDEKDLKGKALLKVIMR 317 



Qy 134 KFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDPDGPLMCHTTKM 193 

II : : I I : I I I I : I I I : I : : I I I I I I I : : I I 

Db 318 KFLPAADAMLEMICIHLPSPVTAQKYRAETLYEGPMDDECAIGIRDCDPKAP LMLYVSKM 377 

Qy 194 FSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLWISVARYHIEV 253 

II : I : I I I I I I : : I I : : I I I : : I : : I : : I : : 

Db 378 VPTSDKGRFYAFGRVFSGIVKSGLKVRIQGPNYIPGKKDDLFVKAIQRTILMMGRFVEPI 437 

Qy 254 NRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQI FRPLKFNTTSVIKIAVEPVNPSELP 313 

M I I I I : I I I I : : I : I : I : I I : : | | : : | : : II I : I I 

Db 438 E D VP AGN I VGL VGVDQ F L L K S GT LT T SETAHNLKVMKFSVSPWQRGVEVKNAQDLP 494 

Qy 314 KMLDGLRKWKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEIDIKVADPVVT 373 

I : : : I I : : : : I I I : I : II I : I : : I I M : I : : II : : : : : : : : I I I I : 
Db 495 KLVEGLKRLSKSDPCVLTMINESGQHIVAGAGELHLEICLKDLEEDHAGVPLRISDPWS 554 

Qy 374 FCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWNRKKLGEFFQTK 4 33 

: I I I I I : : : : | | | | : : : | : | | : : : II: : : I 

Db 555 YRETVGGESSMTALSKSPNKHNRLYVTAQPLGEEVS LAIESGKINPRDDFKARARLLADD 614 

Qy 434 YDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKAL — LG3VKDSIVQG 481 

III: I I I I I I I I I I I : I I I I II: I Mill! 

Db 615 YGWDVTDARKIWCFGPDTTGANLLVDQT KAVQYLNEIKDSFVSG 658 



RESULT 14 
AAB71315 

ID AAB71315 standard; Protein; 845 AA. 
XX 

AC AAB71315; 
XX 

DT 18-NOV-2002 (first entry) 
XX 

DE L. major 4G2-83 extended antigen SEQ ID 110. 
XX 

KW Antigen; immunogenic; antiparasitic; iitimunos timulant ; leishmaniasis; 

KW gene therapy; vaccine; interleukin-12 agonist. 

XX 

OS Leishmania ma j or „ 
XX 

PN US2 002 08132 0-A1 . 
XX 

PD 27- JUN-2002 . 
XX 



PF 


04 


-JUN- 


2001; 2001US- 


0874923 


XX 












PR 


22 


-SEP- 


1995, 


95US- 


0533669 


PR 


12 


-FEB- 


1997, 


97US- 


0798841 


PR 


27 


-AUG- 


1997, 


97US- 


0920609 


PR 


12 


-FEB- 


1998, 


? 98US- 


0022765 


PR 


30 


-OCT- 


1998, 


98US- 


0183861 


PR 


14 


-APR- 


2000, 


; 2000US- 


0551974 


PR 


05 


-MAY- 


2000, 


f 2000US- 


0565501 


PR 


14 


~AUG- 


2000, 


; 2000US- 


0639206 



PA (REED/) REED S G . 

PA (CAMP/) CAMPOS-NETO A. 

PA (WEBB/) WEBB J R. 

PA (DILL/) DILLON D C. 

PA (SKEI/) SKEIKY YAW. 

PA ( BHAT / ) B HAT I A A. 

PA (COLE/) COLER R N. 

PA (PROB/) PROBST P. 

PA (BRAN/) BRANNON M. 

XX 

PI Reed SG, Campos-Neto A, Webb JR, Dillon DC, Skeiky YAW, Bhatia A; 

PI Coler RN, Probst P, Brannon M; 

XX 

DR WPI; 2002-635457/68. 

DR N-PSDB; AAF88573. 
XX 

PT New polypeptide containing at least an immunogenic portion of one or 

PT more Leishmania antigens or their variants, useful for preventing, 

PT treating and detecting leishmaniasis, and stimulating immune responses 

PT in patients - 

XX 

PS Claim 1; Page 108-110; 163pp; English. 
XX 

CC This invention describes a novel polypeptide containing an immunogenic 

CC portion of a Leishmania antigen or its variant which has antiparasitic 

CC and immuno stimulant activity. The compositions and methods of the present 

CC invention are useful for preventing, treating and detecting 

CC leishmaniasis, and stimulating immune responses in patients against 

CC leishmaniasis. The polypeptides and the polynucleotides encoding them can 

CC be used for gene therapy, in vaccines or as interleukin-12 agonists. The 

CC compositions and methods of the present invention, as compared to prior 

CC art, are more improved therapeutic modalities in the diagnosis, 

CC prevention and treatment of leishmaniasis. 

XX 

SQ Sequence 845 AA; 

Query Match 29.4%; Score 743; DB 23; Length 845; 

Best Local Similarity 34.7%; Pred. No. l.le-62; 

Matches 166; Conservative 102; Mismatches 189; Indels 22; Gaps 9 

LSPLLGNVCFSSSQYSICFTLGSFAKI YADTFGDINYQEFAKRLWGDI YFNPKTRKFTKK 71 
: 1 I II I : I : I I I : I I I I : : : : I I I I I : I : I : I : I : 



I : I I : I : I : I | : : : : I I : II I I : II 



iMCVQH I P S P KVGAKP KI EHT YT GGVD S D LGEAMSDCDPDG 18 4 

I I E : I I I I : I I : I : : I : : I I I 



Qy 


12 


Db 


200 


Qy 


72 


Db 


259 


Qy 


129 


Db 


319 


Qy 


185 


Db 


377 



I I : : I I II : I I I I : I I : : I I I : : : I I I 



Qy 



244 ISVARYHIEVNRVPAGNWVLIEGVDQPI VKTATITEPRGNEEAQIFRPLKFNTTSVIKIA 303 



: : I I I : I I I I : I I I : I M : I M I : I | : I : : : I : : : I 

Db 437 LMMGRYQEAVEDMPCGNWGLVGVDKYIVKSATITD — DGESPHPLRDMKYSVS PWRVA 4 94 

Qy 304 VEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMY— SE 362 

I | | | | : | | | : : : | | : : : II I : : I I I I M : : I I I I : I : : I I : : : : 

Db 495 VEAKNPSDLPKLVEGLKRLAKSDPLWCSIEESGEHIVAGAGELHLEICLKDLQEDFMNG 554 

Qy 363 IDIWADPVVTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWN 422 

: I : : : I I I : I I M : I I : I : : : I I I : : I I : I I : ! : 

Db 555 APLKISEPWSFRETVT DVSSQQCLSKSANKHNRLFCRGAPLTEELALAMEEGTAGPEAD 614 

Qy 423 RKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKAL — LGSVKDSIV 479 

I I I : I I : I I I I : I I I I I I : : I I i I : : : M I I 

Db 615 PKVRARFIADNYEWDVQEARKIWCYGPDNRGPNVWD VTKGVQNMAEMKDSFV 667 



RESULT 15 


AAU71861 


ID 


AAU71861 standard; Protein; 845 AA. 


XX 




AC 


AAU71861; 


XX 




DT 


26-FEB-2002 (first entry) 


XX 




DE 


Leishmania antigen 4G2-83 extended protein. 


XX 




KW 


Leishmaniasis; Leishmania antigen; immunos timulant ; protozoacide ; Ldp2 3; 


KW 


interleukin-15; Lbhsp83; M15; Lt-1; LbeIF4A; Lmspla; Lmsp9a; MAPS-lA; 


KW 


LmgSPl; LmgSP3; LmgSPS; LmgSP8; LmgSP9; LmgSP13; LmgSPl9; LcgSPl; LcgSP3; 


KW 


LcgSP4; LcgSP8; LcgSPIO; 1G6-34; 1E6-44; 4A5-63; 1B11-39; 2A10-37; 


KW 


4G2-83; 4H6-41; 8G3-100. 


XX 




OS 


Leishmania major. 


XX 




PN 


WO200179276-A2 . 


XX 




PD 


25-OCT-2001. 


XX 




PF 


05-APR-2001; 2001WO-US11254 . 


XX 




PR 


14-APR-2000; 2 000US-055197 4 . 


PR 


05-MAY-2 000; 2 OOOUS-0565501 . 


PR 


14-AUG-2 000; 2 000US-0 6392 06 . 


XX 




PA 


(CORI-) CORIXA CORP. 


XX 




PI 


Reed SG, Campos-Neto A, Webb JR, Dillon DC, Skeiky YAW, Bhatia A; 


PI 


Coler RM, Probst P; 


XX 




DR 


WPI; 2002-061971/08. 


DR 


N-PSDB; AAS96071. 


XX 




PT 


New isolated Leishmania antigens, useful for prevention, treatment and 


PT 


diagnosis of leishmaniasis, also related nucleic acids for genetic 


PT 


vaccination - 


XX 




PS 


Claim 1; Page 190-192; 193pp; English. 



XX 

CC The invention relates to polypeptides comprising an immunogenic part of a 

CC Leishmania antigen. The Leishmania polypeptides and their associated DNA 

CC sequences, epitopes and fusion proteins are used in the production of 

CC compositions used for inducing a protective immune response against 

CC leishmaniasis , for prevention and treatment of the disease. The 

CC compositions can also be used generally to treat diseases that respond to 

CC interleukin-15 stimulation. In addition, the products may contain an 

CC immuno stimulant. Sequences AAU718 05-AAU718 62 represent Leishmania 

CC antigens and antigenic peptides of the invention. 

XX 

SQ Sequence 845 AA; 

Query Match 29.4%; Score 743; DB 23; Length 845; 

Best Local Similarity 34.7%; Pred. No. l.le-62; 

Matches 166; Conservative 102; Mismatches 189; Indels 22; Gaps 9; 

Qy 12 LSPLLGNVCFSSSQYSICFTLGSFAKI YADTFGDINYQEFAKRLWGDI YFNPKTRKFTKK 71 

: I | || I : I : I I I : I I I I : : : : I I I I I : I : I : I : I : 

Db 200 VSPEKGTVAIGSGLQAWAFSLTRFANMYAAKFG-VDELKMRERLWGDNFFDAKNKKWIKQ 258 

Qy 72 APTSSSQ RSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLNIRPLL 128 

: : | : | : I I : I : I : I I : : : : I I : I I I I : I I 

Db 259 ETNADGERVRRAFCQFCLDPIYQIFDAVMNEKKDKVDKMLKSLHVTLTAEEREQVPXKLL 318 

Q y 129 RLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSD LGEAMSDCDPDG 184 

: I i : I I 1 : I I I I : I I : I : : I : : I I I 

Db 319 KTVMMXFLPAAETLLQMIVAHLPSPKKAQAYRAEMLYSGEASPEDKYFMG — IKNCDPAA 376 

Qy 185 PLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEED-SQICTVGRLW 243 

I I [ : : I I II : I I I I : I I : : I I I : : : I I I : : I = II 

Db 377 PLMLYISKMVPTADRGRFFAFGRI FSGKVRSGQKVRIMGNNYVYGKKQDLYEDKPVQRSV 436 

Qy 244 ISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIA 303 

: : I I | : | | | I : I I I : I I I : I I I I : I | : | : : : I : : : I 

Db 437 LMMG RYQ EAVE DMP C GNWG L VGVD K Y I VK S AT I T D — DGESPHPLRDMKYSVSPWRVA 494 

Qy 304 VEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMY-SE 362 

|| | | 1 : | | I : : : M : : : M I : : I I I I I I : : I M I : I : : I I : : : : 

Db 4 95 VEAKNPSDLPKLVEGLKRLAKSDPLWCSIEESGEHIVAGAGELHLEICLKDLQEDFMNG 554 

Qy 363 IDIKVADPVVTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWN 422 

: I : : : I I I : I I I I : M : I : : : I I I = : I I : I I : I : 

Db 555 APLKISEPVVSFRETVTDVSSQQCLSKSANKHNRLFCRGAPLTEELALAMEEGTAGPE7VD 614 

Qy 423 RKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKAL — LGSVKDSIV 479 

I | | : I I : 1111:111 I I I : : I I ! I : : : II I I 

Db 615 PKVRARFLADNYEWDVQEARKIWCYGPDNRGPNVWD VTKGVQNMAEMKDSFV 667 



Search completed: January 30, 2004, 11:24:34 
Job time : 73.1206 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: January 30, 2004, 11:23:12 ; Search time 27.1381 Seconds 

(without alignments) 
749.923 Million cell updates/sec 



Title : US-09-98 9-4 81-3 

Perfect score: 2527 

Sequence: 1 RAYYSTDENLILSPLLGNVC LPSEVDKALLGSVKDSIVQG 481 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 328717 seqs, 42310858 residues 

Total number of hits satisfying chosen parameters: 328717 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Issued_Patents_M: * 

1 : /cgn2_6/ptodata/l/iaa/5A_COMB.pep : * 

2 : /cgn2__6/ptodata/l/iaa/5B_COMB.pep : * 

3 : /cgn2_6/ptodata/l/iaa/6A_COMB.pep: * 

4 : /cgn2_6/ptodata/l/iaa/6B_COMB.pep : * 

5 : /cgn2_6/ptodata/ l/iaa/PCTUS_COMB . pep : * 

6 : /cgn2_6/ptodata/l/iaa/backf ilesl .pep : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


2527 


100 . 


0 


481 


4 


US- 


08- 


928- 


862-3 


Sequence 


3, Appli 


2 


807.5 


32. 


0 


842 


4 


US- 


09- 


293- 


549-2 


Sequence 


2, Appli 


3 


807.5 


32 


0 


842 


4 


US- 


09- 


293- 


549-4 


Sequence 


4, Appli 


4 


554 . 5 


21 


9 


361 


4 


us- 


09- 


183- 


861-85 


Sequence 


85, Appl 


5 


554. 5 


21 


9 


361 


4 


us- 


09- 


022- 


765-85 


Sequence 


85, Appl 


6 


554 . 5 


21 


9 


361 


4 


us- 


09- 


551- 


974A-8 5 


Sequence 


85, Appl 


7 


179 


7 


1 


900 


4 


us- 


09- 


252- 


991A-25011 


Sequence 


25011, A 


8 


177 


7 


0 


716 


4 


us- 


09- 


252- 


991A-21345 


Sequence 


21345, A 


9 


175 


6 


9 


705 


4 


us- 


09- 


134- 


001C-5356 


Sequence 


5356, Ap 


10 


148.5 


5 


9 


651 


4 


us- 


09- 


198- 


452A-589 


Sequence 


589, App 


11 


113. 5 


4 


5 


277 


4 


us- 


09- 


328- 


352-4749 


Sequence 


4749, Ap 



12 


111 


4 . 


4 


1067 


4 


US- 09-252-9 9 1A-19697 


Sequence 


19697, A 


13 


105 


4. 


2 


1224 


4 


US-09-107-532A-62 2 0 


Sequence 


622 0, Ap 


14 


103 


4. 


1 


1380 


4 


US- 09-32 8-3 52- 8 132 


Sequence 


8132, Ap 


15 


100.5 


4. 


0 


677 


4 


US-09-252-991A-32 924 


Sequence 


32924, A 


16 


97 . 5 


3. 


9 


616 


4 


US-09-32 8-352-534 8 


Sequence 


534 8, Ap 


17 


95 


3. 


8 


738 


6 


5264554-2 


Patent No. 


5264554 


18 


94 


3. 


7 


828 


1 


US-08-261-304-2 


Sequence 


2, Appli 


19 


93 . 5 


3. 


7 


995 


5 


PCT-US95-04 910-14 


Sequence 


14, Appl 


20 


93 


3. 


7 


738 


3 


US-08-478-208-32 


Sequence 


32, Appl 


21 


93 


3. 


7 


738 


4 


US-09-336-536-73 


Sequence 


73, Appl 


22 


92 . 5 


3. 


7 


531 


4 


US-0 9-134-00 1C-3574 


Sequence 


3574, Ap 


23 


92 


3. 


6 


479 


4 


US-0 9-134-001C-412 8 


Sequence 


412 8, Ap 


24 


92 


3. 


6 


732 


4 


US-08-671-757A-7 


Sequence 


7, Appli 


25 


92 


3. 


6 


732 


4 


US-08-671-757A-8 


Sequence 


8, Appli 


26 


91 


3. 


6 


240 


4 


US-09-107-532A-417 5 


Sequence 


4175, Ap 


27 


91 


3 


6 


425 


4 


US-09-134-0 01C-2 8 95 


Sequence 


2895, Ap 


28 


90 . 5 


3 


6 


527 


3 


US-08-369-822C-25 


Sequence 


25, Appl 


29 


90 . 5 


3 


6 


527 


3 


US- 0 8-5 82-7 7 6C- 4 0 


Sequence 


40, Appl 


30 


90 . 5 


3 


6 


527 


3 


US- 08-4 34-8 31B-37 


Sequence 


37, Appl 


31 


90 


3 


6 


499 


5 


PCT-US96-03916-8 


Sequence 


8, Appli 


32 


90 


3 


6 


499 


5 


PCT-US96-03 916-69 


Sequence 


69, Appl 


33 


89.5 


3 


5 


634 


4 


US-09-134-001C-3390 


Sequence 


3390, Ap 


34 


89.5 


3 


5 


840 


2 


US-08-467-822-25 


Sequence 


25, Appl 


35 


89.5 


3 


5 


840 


3 


US-08-432-697-25 


Sequence 


25, Appl 


36 


89.5 


3 


5 


840 


3 


US-08-466-248-25 


Sequence 


25, Appl 


37 


89 . 5 


3 


.5 


1013 


4 


US-08-860-886-2 


Sequence 


2, Appli 


38 


89 . 5 


3 


. 5 


1250 


1 


US-08-441-139-9 


Sequence 


9, Appli 


39 


89 


3 


.5 


1014 


3 


US-09-078-347A-3 


Sequence 


3, Appli 


40 


89 


3 


.5 


1708 


1 


US-08-493-092-2 


Sequence 


2, Appli 


41 


89 


3 


. 5 


1708 


1 


US-08-508-836A-2 


Sequence 


2, Appli 


42 


89 


3 


.5 


1786 


3 


US-08-973-462-8 


Sequence 


8, Appli 


43 


89 


3 


.5 


3056 


1 


US-08-508-836A-8 


Sequence 


8, Appli 


44 


89 


3 


.5 


3056 


2 


US-08-629-001A-3 


Sequence 


3, Appli 


45 


89 


3 


. 5 


3056 


2 


US-08-874-266-2 


Sequence 


2, Appli 



ALIGNMENTS 



RESULT 1 
US-08-928-862-3 

; Sequence 3, Application US/08928862 

; Patent No. 6309877 

; GENERAL INFORMATION: 

; APPLICANT: Chau, Raymond M. W. 

; TITLE OF INVENTION: Isolation and Use of Motoneuronot rophic Factors 

FILE REFERENCE: 12592-2 
; CURRENT APPLICATION NUMBER: US / 0 8 / 92 8 , 8 62 
; CURRENT FILING DATE: 1997-09-12 
; NUMBER OF SEQ ID NOS : 5 

SOFTWARE: Patentln Ver. 2.0 
; SEQ ID NO 3 
; LENGTH: 481 
; TYPE: PRT 

ORGANISM: Homo sapiens 
US-08-928-862-3 



Query Match 100.0%; Score 2527; DB 4; Length 481; 

Best Local Similarity 100.0%; Pred. No. 2.8e~262; 

Matches 481; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


RAYYSTDENLILSPLLGNVCFSSSQYSICFTLGSFAKIYADTFGDINYQEFAKRLWGDIY 


60 




I M I 1 1 1 1 I i 1 1 1 I 1 1 I 1 1 1 I 1 1 1 1 1 1 ! 1 I M 1 1 1 1 1 1 M ! E I 1 1 1 1 1 t 1 1 1 1 1 1 I 1 1 M 








RAYYSTDENLILSPLLGNVCFSSSQYSICFTLGSFAKIYADTFGDINYQEFAKRLWGDIY 


60 


Qy 


61 


FNPKTRKFTKKAPTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEEL 


120 




I i | I I I 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 I 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 II I 1 M 1 1 t 1 




JJO 


O J. 


FWPKTRKFTXKAPT^S SORSFVEFI LEPLYKI LAOWGDVDTSLPRTLDELGIHLTKEEL 


120 


Qy 


121 


KLNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDC 


180 




1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 




DD 


1 Ol 

1Z1 


fcTT mtrpt T "RT vr , 'K'K'F 1 F 1 f^F FTf;F\/DMf VOHT P S P KVGAKPKI EHTYTGGVDSDLGEAMSDC 


180 


Qy 


181 


DPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVG 


240 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Do 


101 


> 1 i v k - n x J. xVl y lJi O inub vyr n. it i. ur\ v w j. _l ri_riAj^-c vrvv uouim x, ± _1_1j_1i_y.i_1.1_1 _y >j __ x v 


240 


Qy 


241 


RLWISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVI 


300 




1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 M 1 ! 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 




Db 


241 


RLW I S VAR YH I E VN RVP AGN WVL I E G VDQ P I VKT AT I T E P RGN E EAQ I FRP L K FN T T S VI 


300 


Ov 


301 


KIAVEPWPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMY 


360 




I I I I I I I I 1 j 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


KIAVEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMY 


360 


Qy 


361 


SEIDIKVADPVVTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQIT 


420 




I I I I I I I I 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M II 1 1 1 




Db 


361 


SEIDIKVADPVVTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQIT 


420 


Qy 


421 


WNRKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNI LVDD 1 IjFb tVDJ\/U__jQ_ V_\JJoX 


4 r n 




1 I M I I I I I I I 1 1 1 II 1 1 1 I 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 i 




Db 


421 


WNRKKLGEFFQTKYDWDLLA7VRS I WAFGPDATGPNI LVDDTLP SEVDKALLGSVKDS IVQ 


480 


Qy 


481 


G 481 
1 




Db 


481 


G 481 




RESULT 2 








US-09-293 


-549 


-2 




; Sequence 2, 


Application US/09293549 




; Patent 


No. 


6440409 





GENERAL INFORMATION: 
APPLICANT: G. Todd Milne 
APPLICANT: Gerald Fink 

TITLE OF INVENTION: A METHOD FOR EFFICIENT AND HIGHLY 
TITLE OF INVENTION: SELECTIVE CONTROL OF MICROORGANISMS 
FILE REFERENCE: 50078/008002 

CURRENT APPLICATION NUMBER: US/09/293,549 
CURRENT FILING DATE: 1999-04-16 
EARLIER APPLICATION NUMBER: 60/082,089 
EARLIER FILING DATE: 1998-04-17 
NUMBER OF SEQ ID NOS : 14 

SOFTWARE: Fas t SEQ for Windows Version 3.0 



; SEQ ID NO 2 

LENGTH: 842 
TYPE : PRT 

; ORGANISM: Saccharomyces cerevisiae 
US-09-293-549-2 



Query Match 32.0%; Score 807.5; DB 4; Length 842; 

Best Local Similarity 35.7%; Pred. No. 3e-77; 

Matches 168; Conservative 106; Mismatches 182; Indels 15; Gaps 



Qy 


1 A 


pt T aMVrF^q ^nY^TfFTT.G^FAKT YADTFGDINYOEFAKRLWGDI YFNPKTRKFTKK — 

| Mil : II: II II II:: • M 1 1 1 : 1 1 1 1 I : 1 : 1 1 
PARGTVAFGSGLHGWAFTIRQFATRYAKKFG-VDKAKMMDRLWGDSFFNPKTKKWTNKDT 


71 




9 n a 

<L U ft 


2 62 


Qy 




nPTcjqqnDciWPFTT FPT YKTT AOW^DVDT S LPRTLDELGI HLTKEELKLN I RPLLRL 

1 : | : | | | | : | : : : : : : : I 1 : : 1 1 1 : 1 1 : 1 1 : : 

DAEGKPLERAFNMFILDPIFRLFTAIMNFKKDEIPVLLEKLEIVLKGDEKDLEGKALLKV 


130 


Db 


O "3 
Z DO 


322 


Qy 


± o _L 


vrTf^FFRFFTRFVDMPVOHT PS PKVGAKPKI EHT YTGGVDSDLGEAMSDCDPDGPLMCHT 


190 






1 : 1 1 : : 1 1 1 : 1 1 ! : 1 II 1 1 : : 1 1 1 1 1 : 
VMRKFLPAADALLEMIVLHLP S PVTAQAYRAEQLYEGPADDANCIAI KNCDPKADLMLYV 


382 


Qy 


191 


TKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLWISVARYH 

: I I || : 1 : 1 1 1 1 : 1 1 : : 1 1 1 : : 1 1 1 : : 1 1 : 1 : : : 1 : 

SKMVPTSDKGRFYAFGRVFAGTVKSGQKVRIQGPNYVPGKKDDLFIKAIQRWLMMGRFV 


250 


Db 


383 


442 


Qy 


251 


IEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIAVEPVNPS 
: : | | | | : : I : 1 1 : : 1 I 1 : 1 : 1 1 : : | | : : I : : : 1 1 1 1 : 
EPIDDCPAGNIIGLVGIDQFLLKT GTLTT SETAHNMKVMKFSVS PWQVAVEVKNAN 


310 


Db 


443 


499 


Qy 


311 


ELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEIDIKVADP 

: 1 | | : : : I I : : : : I I 1 : 1 : MM!:: | | | | | : | : : M : : : = 1 : : 1 

DLPKLVEGLKRLSKSDPCVLTYMSESGEHIVAGTGELHLEICLQDLEHDHAGVPLKISPP 


370 


Db 


500 


559 


Qy 


371 


VVTFCETWETSSLKCFAETPNKKNKITMIA£PLEKGIxAEDIENEWQITWNRKKLGEFF 

El : Ml M : : : | | | I : I : M 1 : : : : : III :: : 1 

WAYRETVESES SQTALSKS PNKHNRI YLKAEPIDEEVS LAI ENGI INPRDDFKARARIM 


430 


Db 


560 


619 


Qy 


431 


QTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKAL — LGSVKDSIV 47 9 
I M : 1 1 II 1 1 M I 1 1 : : : 1 1 1 1 : 1 : i II : 1 




Db 


620 


ADDYGWDVTDARKI WCFGPDGNGPNLVI DQT KAVQYLHEI KDSW 664 





RESULT 3 
US-09-293-549-4 

; Sequence 4, Application US/09293549 

; Patent No. 6440409 

; GENERAL INFORMATION: 

; APPLICANT: G. Todd Milne 

APPLICANT: Gerald Fink 
; TITLE OF INVENTION: A METHOD FOR EFFICIENT 7\ND HIGHLY 
; TITLE OF INVENTION: SELECTIVE CONTROL OF MICROORGANISMS 

FILE REFERENCE: 50078/008002 
; CURRENT APPLICATION NUMBER: US/09/2 93, 549 
; CURRENT FILING DATE: 1999-04-16 

EARLIER APPLICATION NUMBER: 60/082,089 
; EARLIER FILING DATE: 1998-04-17 



NUMBER OF SEQ ID NOS : 14 
; SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 4 
; LENGTH: 842 
TYPE: PRT 

; ORGANISM: Saccharomyces cerevisiae 
US-09-293-549-4 



Query Match 32.0%; Score 807.5; DB 4; Length 842; 

Best Local Similarity 35.7%; Pred. No. 3e-77; 

Matches 168; Conservative 106; Mismatches 182; Indels 15; Gaps 



Qy 


1 A 


PLLGNVCFSSSQYSICFTLGSFAKI YADTFGDINYQEFAKRLWGDIYFNPKTRKFTKK — 

| I I I 1 : It: : : : M 1 1 1 : 1 I 1 M : 1 : 1 1 

PZiDrTVA^r^fiT HOT A FT T RO FAT RYAKKFG- VDKAKMMDRLWGDSFFNPKTKKWTNKDT 


71 


Db 


Z U 4 


262 


Qy 


7 9 


-APTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLNIRPLLRL 

| : | : | |||:|:::: : : : 1 1 : : 1 1 1 : 1 1 : 1 1 = = 

DAEGKP LERAFNMFI LD P I FRL FTAIMNFKKDEI PVLLEKLE I VLKGDEKDLEGKALLKV 


130 


JJD 


Z D O 


322 


wy 


131 


VCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDPDGPLMCHT 

| : ! | : : 1 1 1 : 1 1 1 : 1 II 1 1 : : 1 1 1 1 1 : 
VMRKFLPAADALLEMIVLHLPSPVTAQAYRAEQLYEGPADDANCIAI KNCDPKADLMLYV 


190 


Db 


323 


382 


Qy 


191 


TKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLWISVARYH 

: | 1 II : I : I 1 1 1 : 1 1 : : II 1 : : 1 1 1 : : 1 1 : 1 : : : 1 : 
SKMVPTSDKGRFYAFGRVFAGTVKSGQKVRIQGPNYVPGKKDDLFIKAIQRWLMMGRFV 


250 


Db 


383 


442 


Qy 


251 


IEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIAVEPVNPS 

: : | | | | : : 1 : 1 1 : : 1 1 1 : 1 : 1 1 : : I I : : 1 : : : 1 1 1 1 : 
EPIDDCPAGNIIGLVGIDQFLLKTGTLTT SETAHNMKVMKFSVS PWQVAVEVKNAN 


310 


Db 


443 


499 


Qy 


311 


ELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEIDIKVADP 

: | | | : : : | I : : : : I 1 1 : 1 : Mill:: I II II : M : M : : : M : : 1 
DLPKLVEGLKRLSKSDPCVLTYMSESGEHIVAGTGELHLEICLQDLEHDHAGVPLKISPP 


370 


Db 


500 


559 


Qy 


371 


VWFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEVVQITWNRKKLGEFF 

| 1 : M 1 II : : : | | | | : I : I I I : : : : : III:: : 1 
WAYRETVESESSQTALSKSPNKHNRI YLKAEPIDEEVS LAI ENGI INPRDDFKARARIM 


430 


Db 


560 


619 


Qy 


431 


QTKYDWDLLAARS IWAFGPDATGPNI LVDDTLP SEVDKAL — LGSVKDSIV 47 9 
III: I 1 1 1 1 1 M II I : : : 1 I II : 1 M 1 1 : 1 




Db 


620 


ADD YGWDVTDARKIWCFGPDGNGPNLVI DQT KAVQYLHEIKDSW 664 





RESULT 4 

US-09-183-861-85 

; Sequence 85, Application US/09183861 

; Patent No. 6365165 

; GENERAL INFORMATION: 

APPLICANT : Reed, Steven G. 
; APPLICANT: Campos-Neto, Antonio 

APPLICANT: Webb, John R. 

APPLICANT: Dillon, Davin C. 

APPLICANT: Skeiky, Yasir A.W. 

TITLE OF INVENTION: LEISHMANIA ANTIGENS FOR USEIN THE THERAPY AND 
; NUMBER OF SEQUENCES: 87 



CORRESPONDENCE ADDRESS: 

ADDRESSEE: SEED and BERRY LLP 

STREET: 6300 Columbia Center, 701 Fifth Avenue 
CITY: Seattle 
STATE: Washington 
COUNTRY: USA 
ZIP: 98104-7092 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 0 9/ 1 8 3 , 8 61 
FILING DATE: 
CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 09/022,7 65 
FILING DATE: 12-FEB-1998 
ATTORNEY/AGENT INFORMATION: 
NAME: Maki, David J. 
REGISTRATION NUMBER: 31,3 92 
REFERENCE/ DOCKET NUMBER: 210121. 420C3 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (206) 622-4900 
TELEFAX: (206) 682-6031 
INFORMATION FOR SEQ ID NO: 85: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 361 amino acids 
TYPE: amino acid 
STRANDEDNESS : 
TOPOLOGY: linear 
US-09-183-861-85 

Query Match 21.9%; Score 554.5; DB 4; Length 361; 

Best Local Similarity 34.6%; Pred. No. le-50; 

Matches 122; Conservative 82; Mismatches 136; Indels 13; Gaps 6 

Qy 66 RKFTKKAPTSSSQ RSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKL 122 

: | : I : : : 1 : I : I I : I : 1 : I I : : : : I I : I I I I = 

D b 1 KKWIKQETNADGERVRRAFCQFCLDPIYQIFDAVMNEKKDKVDKMLKSLHVTLTAEEREQ 60 

Qy 123 NI RPLLRLVCKKFFGEFTGFVDMCVQHI P S PKVGAKPKI EHT YTGGVDS D LGEAMS 178 

||: I i : I I I : M I I : I I : I : : I : 

D b 61 VPXKLLKTVMMXFLPAAETLLQMIVAHLPSPKKAQAYRAEMLYSGEASPEDKYFMG — IK 118 

Q y 179 DCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEED-SQIC 237 

: | f | I I I : : I I II : I I I I : I I : : I I I : : : I I I : : ' : 

Db 119 NCDPAAPLMLYISKMVPTADRGRFFAFGRIFSGKVRSGQKVRIMGNNYVYGKKQDLYEDK 178 

Q y 2 38 TVGRLWISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTT 2 97 

|| : : I I I : I I t t : I I i : I M : I M I : I I : I : : : 

Db 179 PVQRS VLMMGRYQEAVEDMP CGNVVGLVGVDKYI VKS AT ITD — DGES PH P LRDMKYS VS 23 6 

Qy 298 SVIKIAVEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLR 357 

| : : : | | | I I I : I 1 I : : : M : : : II I : : I I I I I I : : I I I I = I : : I I : 
Db 237 PWRVAVEAKNPSDLPKLVEGLKRLAKSDPLWCSIEESGEHIVAGAGELHLEICLKDLQ 296 



Qy 358 KMY-SEIDIKVADPWTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLA 409 

: : : : | : : : | | | : j I I I : I I : I : : : I I ! : : I I : I I 

Db 2 97 EDFMNGAPLKI S EPWS FRETVTDVS SQQCLS KSANKHNRLFCRGAPLTEXLA 34 9 



RESULT 5 

US-09-022-765-85 

Sequence 85, Application US/09022765 
Patent No. 6375955 
GENERAL INFORMATION: 

APPLICANT: Reed, Steven G. 
APPLICANT: Campos-Neto, Antonio 
APPLICANT: Webb, John R. 
APPLICANT: Dillon, Davin C. 
APPLICANT: Skeiky, Yasir A.W. 

TITLE OF INVENTION: LEISHMANIA ANTIGENS FOR USEIN THE THERAPY AND 
NUMBER OF SEQUENCES: 87 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SEED and BERRY LLP 

STREET: 6300 Columbia Center, 701 Fifth Avenue 
CITY: Seattle 
STATE: Washington 
COUNTRY: USA 
ZIP: 98104-7092 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/ 022, 765 
FILING DATE: 12-FEB-1998 
CLASSIFICATION: 
ATTORNEY/AGENT INFORMATION : 
NAME: Maki, David J. 
REGISTRATION NUMBER: 31,392 
REFERENCE/ DOCKET NUMBER: 210121. 420C3 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (206) 622-4900 
TELEFAX : (206) 682-6031 
INFORMATION FOR SEQ ID NO: 85: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 3 61 amino acids 
TYPE: amino acid 
STRANDEDNESS : 
TOPOLOGY: linear 
US-09-022-765-85 

Query Match 21.9%; Score 554.5; DB 4; Length 361; 

Best Local Similarity 34.6%; Pred. No. le-50; 

Matches 122; Conservative 82; Mismatches 136; Indels 13; Gaps 6 

Qy 66 RKFTKKAPTSSSQ RSFVEFI LEPLYKI LAQWGDVDTSLPRTLDELGIHLTKEELKL 122 

: [ : | : : : I : I : I I : I : I : [ I : : : : I I : I I I I : 

Db 1 KKWIKQETNADGERVRRAFCQFCLDPIYQI FDAVMNEKKDKVDKMLKSLHVTLTAEEREQ 60 



Qy 123 NIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSD LGEAMS 178 

M : | I : I I i : I If I : I I : I : • I = 

D b 61 VPXKLLKTVMMXFLPAAETLLQMIVAHLPSPKKAQAYRAEMLYSGEASPEDKYFMG^-IK 118 

Qy 179 DCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEED-SQIC 237 

: I I I I | | : : I I II : I I I I : I I : : I I I : : : I I I : : I : 

Db 119 NCDPAAP LMLYI SKMVPTADRGRFFAFGRI FSGKVRSGQKVRIMGNNYVYGKKQDLYEDK 17 8 

Qy 238 TVGRLWISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTT 297 

II : : I I I : I I I I : I I i : M I : I I H : I I : I : : : 

Db 179 PVQRSVLMMGRYQEAVEDMPCGNWGLVGVDKYIVKSATITD — DGES PHPLRDMKYSVS 236 

Qy 2 98 SVIKIAVEPWPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLR 357 

| : : : | | I II I : I I I : : : I I : : : II I : : I I I I I I : : I I I I : I : : I I : 
D b 237 P WRVAVEAKN P S DLP KLVEGLKRLAKS DPLVVC S I EE S GEHI VAGAGELHLEI CLKDLQ 2 96 

Qy 358 KMY- SEI DI KVADPVVT FCETWET S S LKCFAETPNKKNKI TMI AEPLEKGLA 409 

: : : : I : : : I I I : I I I I : I I : I : : : I i I : : I I : I I 

Db 297 EDFMNGAPLKI SEPWSFRETVTDVSSQQCLSKSANKHNRLFCRGAPLTEXLA 349 



RESULT 6 

US-09-551-974A-85 

Sequence 85, Application US/09551974A 
Patent No. 6500437 
GENERAL INFORMATION: 
APPLICANT: Reed, Steven G. 
APPLICANT: Campos -Neto , Antonio 
APPLICANT: Webb, John R. 
APPLICANT: Dillon, Davin C. 
APPLICANT: Skeiky, Yasir A.W. 

TITLE OF INVENTION: LEISHMANIA ANTIGENS FOR USE IN THE 
TITLE OF INVENTION: THERAPY AND DIAGNOSIS OF LEISHMANIASIS 
FILE REFERENCE: 210121. 420C5 

CURRENT APPLICATION NUMBER: US/ 0 9/ 5 5 1 , 97 4A 
CURRENT FILING DATE: 2000-04-14 
NUMBER OF SEQ ID NOS : 101 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 85 
LENGTH: 3 61 
TYPE: PRT 

ORGANISM: Leishmania major 
FEATURE : 

NAME/ KEY: VARIANT 
LOCATION: (1) . . . (361) 

OTHER INFORMATION: Xaa = Any Amino Acid 
US-09-551-974A-85 

Query Match 21.9%; Score 554.5; DB 4; Length 361; 

Best Local Similarity 34.6%; Pred. No. le-50; 

Matches 122; Conservative 82; Mismatches 136; Indels 13; Gaps 6 

Qy 66 RKFTKKAPTSSSQ RSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKL 122 

: | : | : : : | : I : I I : 1 : I : I I : : : : I I : I I I I : 

Db 1 KKWIKQETNADGERVRRAFCQFCLDPIYQIFDAVMNEKKDKVDKMLKSLHVTLTAEEREQ 60 



QY 



123 NIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSD 



— LGEAMS 178 



I I : I t : I ! I : I I II : I I : I = : I : 

Db 61 VPXKLLKTVMMXFLPAAETLLQMIVAHLPSPKKAQAYRAEMLYSGEASPEDKYFMG — IK 118 

Qy 179 DCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEED-SQIC 237 

: | | I ill : : I I II : I I I I : M : : I I I : : : I M : : I : 

D b 119 NCDPAAPLMLYISKMVPTADRGRFFAFGRIFSGKVRSGQKVRIMGNNYVYGKKQDLYEDK 17 8 

Qy 238 TVGRLWISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTT 297 

|| : : | I I : I I 1 I : I i I : I I I : I I I I : I I : I : : : 

Db 179 P VQ RS VLMMG R YQ EAVE DMP C GNWGLVGVD K Y I VK S AT I T D — DGESPHPLRDMKYSVS 236 

Qy 298 SVIKIAVEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLR 357 

| : : : I I I I I I : I I I : : : I I : : : II I : : I I I 1 I I : : I I I I : I : = I I : 
Db 237 PWRVAVEAKNPSDLPKLVEGLKRLAKSDPLWCSIEESGEHIVAGAGELHLEICLKDLQ 296 

Qy 358 KMY-SEIDIKVADPWTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLA 4 09 

: : : : | : : : I I I : I I II : I I : i : : : I I I : : M : I I 

Db 297 EDFMNGAPLKISEPWSFRETVTDVSSQQCLSKSANKHNRLFCRGAPLTEXLA 349 



RESULT 7 

US- 09-252-99 1A-25 Oil 

; Sequence 25011, Application US/09252991A 
; Patent No. 6551795 
; GENERAL INFORMATION: 

; APPLICANT: Marc J. Rubenfield et al . 

; TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 
PSEUDOMONAS 

; TITLE OF INVENTION: AERUGINOSA FOR DIAGNOSTICS AND THERAPEUTICS 
; FILE REFERENCE: 107196.136 

; CURRENT APPLICATION NUMBER: US/09/252, 99 1 A 

; CURRENT FILING DATE: 1999-02-18 

; PRIOR APPLICATION NUMBER: US 60/074,788 

PRIOR FILING DATE: 1998-02-18 

PRIOR APPLICATION NUMBER: US 60/094,190 
; PRIOR FILING DATE: 1998-07-27 
; NUMBER OF SEQ ID NOS : 33142 
; SEQ ID NO 25011 
; LENGTH: 9 00 
TYPE: PRT 

; ORGANISM: Pseudomonas aeruginosa 
; FEATURE: 

NAME /KEY: UNSURE 
; LOCATION: (857) 

; OTHER INFORMATION: Identity of amino acid at the above locations are 
unknown . 

US-09-252-991A-25011 

Query Match 7.1%; Score 179; DB 4; Length 900; 

Best Local Similarity 22.2%; Pred. No. le-09; 

Matches 92; Conservative 55; Mismatches 143; Indels 124; Gaps 18 

Qy 109 DEL GIHLTKEELKLNIRPLLRLVCKKFFGEFTG FVDMCVQHIPS 152 

I I I I I : I I : I : I I • : 1 : | : : : I : 

D b 386 DELMNKYLEGEELS I EEI KAGVRQ — RTLANQIVPAVLGSSFKNKGVPLVLDAVIDYLPA 443 



153 PKVGAKPKIEHTYTGGVDSDLGEAMSD — CDPDGPLMCHTTKMFSTHDGVQFHPFGRVLS 210 



Db 


444 


Qy 


211 


Db 


496 


Qy 


267 


Db 


545 


Qy 


316 


Db 


587 


Qy 


375 


Db 


646 


Qy 


435 


Db 


672 



ill! : III I : : I E Mil 



III: I I : III: 



| : | | : : : I I I : I I I I : : t I 

DKPIILERMDFPDP VI SVAVEPKTKADQEKM 586 



II I I I : : : I I I I I : I I 



| | : | | : | | : | : : | : | 

IETIRNT C EI EGKFVRQS GGRGQFGH 671 



III I 1:1 : I I 



RESULT 8 

US- 09-2 52- 9 9 1A-2 1345 

; Sequence 21345, Application US/09252991A 
; Patent No. 6551795 
; GENERAL INFORMATION: 

; APPLICANT: Marc J. Rubenfield et al . 

; TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 
PSEUDOMONAS 

TITLE OF INVENTION: AERUGINOSA FOR DIAGNOSTICS AND THERAPEUTICS 
FILE REFERENCE: 107196.136 

CURRENT APPLICATION NUMBER: US/ 0 9/ 2 52 , 9 9 1A 

CURRENT FILING DATE: 1999-02-18 

PRIOR APPLICATION NUMBER: US 60/074,788 
; PRIOR FILING DATE: 1998-02-18 

PRIOR APPLICATION NUMBER: US 60/094,190 

PRIOR FILING DATE: 1998-07-27 
; NUMBER OF SEQ ID NOS : 33142 
; SEQ ID NO 21345 
; LENGTH: 716 
; TYPE: PRT 

; ORGANISM: Pseudomonas aeruginosa 
US- 09-2 52- 99 1A-2 13 45 

Query Match 7.0%; Score 177; DB 4; Length 716; 

Best Local Similarity 21.6%; Pred. No. l.le-09; 

Matches 99; Conservative 75; Mismatches 172; Indels 112; Gaps 21 

Qy 59 IYFNPKTRKFTKKAPTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKE 118 

||:| : I : :: :: : ! : : : : I III:: 

Db 201 I YWNDDDKGMTYREEEIPAE LKDLAEEWRSSMVEAAAEANEELMNKYLEEG-ELSEA 256 

Qy 119 ELKLNIRPLLRLVCK KFFGE FTGFV- DMCVQH I P S P KVGAKP KI EHT YTGGV 169 

I : I : I I I I : I : | | : : : | : | I I : II 



Db 



257 EI KEGLR-LRTLACEI VPAVCGS S FKNKGVPLVLDAVI D YLPAPT — EIPAIK GV 308 



Qy 




Db 


309 


Qy 


£. D 




3 67 


Qy 


0 1 ^ 


JJL) 


416 


Qy 


332 


Db 


458 


Qy 


391 


Db 


509 


Qy 


451 


Db 


544 



.DLGEAMSD CDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPV — KVLGE 22 4 

I I : I I : i I : : I I I M I I : : I I I I : 



' VG RLW I S VARYH I E VN RVP AGNWVL IEG VDQPIV 272 

Mi: 1 I : I M : : i = = = M : 



I I M : I 1 M 



I [ M i : : I ! 1 M : M : : : : : : : : : I I : I I : : : 

TfTnF.F.SfiDTTTSGMGELHLDIIVDRMKREFG-VEANIGKPQVAYRETITKDN 50 8 



M : I : 



-PNILVDDTLPSE VDKALLGSVKDSIVQG 4 81 

I : I : I I : I : : I : : I I 



RESULT 9 

US-09-134-0 01C-5356 

; Sequence 5356, Application US/09134001C 
; Patent No. 6380370 
; GENERAL INFORMATION: 

; APPLICANT: Lynn Doucette-Stamm et al 

; TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 
STAPHYLOCOCCUS 

; TITLE OF INVENTION: EPIDERMIDIS FOR DIAGNOSTICS AND THERAPEUTICS 
; FILE REFERENCE: GTC-007 

CURRENT APPLICATION NUMBER: US/ 0 9/ 134 , 0 0 1C 
; CURRENT FILING DATE: 1998-08-13 
; PRIOR APPLICATION NUMBER: US 60/064,964 

PRIOR FILING DATE: 1997-11-08 

PRIOR APPLICATION NUMBER: US 60/055,779 
; PRIOR FILING DATE: 1997-08-14 

NUMBER OF SEQ ID NOS : 5674 
; SEQ ID NO 5356 
LENGTH: 7 05 
TYPE: PRT 

; ORGANISM: Staphylococcus epidermidis 
US-09-134-0 01C-535 6 

Query Match 6.9%; Score 175; DB 4; Length 705; 

Best Local Similarity 23.9%; Pred. No. 1.8e-09; 

Matches 92; Conservative 70; Mismatches 151; Indels 72; Gaps 17 

Qy 71 KAPTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLNIRPLLRL 130 

I : : : I : 1 : : : : | | : i MM : : : I : I 

Db 216 KERAEEARAQLIEAVAENNDDLMEKYLGDEEIS VDELKDAI RQATTDVEFYPVL — 269 



Qy 131 VCKKFFGEFTGFVDMCVQ HIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDP 182 

| I I : I I : : | | | : M | ! : : I : : I 

Db 270 -CG TAFKNKGVQLMLNAVI D YLPS P— LDVKP 1 1 GHR ANNPDEEWAKPDD 317 

Qy 183 DGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRL 242 

I : | I I M III: : I 11 I = = I I'll 
Db 318 SAEFAALAFKVM-TDPYVGKLTFFRVYSGTLSSGSYVK NSSKDKRE RVGRL 367 

Qy 243 WISVARYHIEWRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKI 302 

| I :: I : I I : : I ! I : I : : I ' ' : 

Db 368 LQMHANSRQEIDTVYSGEIAAAVGLKE TGTGDTLCGEKNDI ILESMEF-PEPVIHL 422 

Qy 303 AVEPVNPSELPKMLDGLRKVNKSYPSLTTKV-EESGEHVILGTGELYLDCVMHDLRKMYS 361 

: | | | : : : | | I I : : I : I I : I : : I I I I I : I I : : : : I : : 

Db 423 SVEPKSK7M)QDKMTQALVKLQEEDPTFHAHTDEETGQVIIGGMGELHLDILVDRMKKEFN 482 

Qy 362 EIDIKVADPWTFCETWETSSLK-CFAE TPNK KNKIT 398 

: : I | : | : : | | : : : : I : III: : I I 

Db 4 83 -VECNVGAPMVSYRETFKQ PAQVQGKFSRQSGGRGQYGDVHIEFTPNETGGGFEFENAIV 541 

Qy 399 MIAEP LEKGLAEDIENEW 417 

I : | : | | : : I I I : 

Db 542 GGWP RE YI P SVEQGLKDAMENGVL 566 



RESULT 10 

US-09-198-452A-58 9 

; Sequence 589, Application US/09198452A 

; Patent No. 6559294 

; GENERAL INFORMATION: 

; APPLICANT: Griffais, R. 

; TITLE OF INVENTION: Chlamydia pneumoniae genomic sequence and polypeptides, 
fragments 

TITLE OF INVENTION: thereof and uses thereof, in particular for the 
diagnosis, prevention 

; TITLE OF INVENTION: and treatment of infection 
; FILE REFERENCE: 9710-003-999 

CURRENT APPLICATION NUMBER: US/09/198 , 452A 
; CURRENT FILING DATE: 1998-11-24 
; NUMBER OF SEQ ID NOS : 68 4 9 
; SEQ ID NO 589 

LENGTH: 651 

TYPE: PRT 

ORGANISM: Chlamydia pneumoniae 
US-09-198-452A-58 9 

Query Match 5.9%; Score 148.5; DB 4; Length 651; 

Best Local Similarity 22.4%; Pred. No. l.le-06; 

Matches 86; Conservative 75; Mismatches 164; Indels 59; Gaps 19 

Qy 59 IYFNPKT — RKFTKKAPTSSSQRSFVEF ILEPLYKI LAQWGDVDTSLPR 106 

: | | | I : : I : : I : I I I I : : I : I I : 
Db 142 LYFLDDTLGAKWEEKEISEDLKERCAELRANLLEELATIDESNEAFMMKVLEDPDSI 198 

Qy 107 TLDELGIHLTKEELKLNIRPLLRLVCKKFFGE — FTGFVDMCVQHIPSPKVGAKPKIEHT 164 

III: : I : : I | : I I I : : : | : : I I I : : 



Db 



199 TEDEIHQVMRKGVIENKINPVL CGTAFKNKGVQQLLNVIVKWLPSP— LDRG 248 



Qy 


ICR 




9 A Q 
Z. H. z) 


Qy 


o o o 


Do 


"3 fi c> 


Qy 


283 


Db 


355 


Qy 


342 


Db 


413 


Qy 


397 


Db 


468 



I : | 1 I I I I : I I I I : I I I 



: | I I : : I : I : : : 

-RISRLLEMHANERTDRDEFTVGDIGACVGLKFSVTGDTLCDD 354 



| : | : I : : I I I : I : I I : : I : I = : : 1 : I I : I : : I 

NQEIVLER-IEF-PDPVIDMAIEPKSKGDREKLAQALSSLSEEDPTFRVSTNEETGQTII 

LGTGELYLDCVMHDLRKMYSEIDIKVADPWTFCETWETSSLKCFAETPNKKNK 

| | | | : ! | :: I ::: ! I I :: I I : : : =11 1 



I I 



RESULT 11 

US-0 9-32 8-352-47 4 9 

Sequence 4749, Application US/09328352 
Patent No. 6562958 
GENERAL INFORMATION: 
APPLICANT: Gary L. Breton et al . 

TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 
ACINETOBACTER 

TITLE OF INVENTION: BAUMANNII FOR DIAGNOSTICS AND THERAPEUTICS 
FILE REFERENCE: GTC99-03PA 

CURRENT APPLICATION NUMBER: US/ 09/32 8 , 352 
CURRENT FILING DATE: 1999-06-04 
NUMBER OF SEQ ID NOS : 8252 
SEQ ID NO 4749 
LENGTH: 277 
TYPE: PRT 

ORGANISM: Acinetobacter baumannii 
US- 09-32 8-352-4 7 49 

Query Match 4.5%; Score 113.5; DB 4; Length 277; 

Best Local Similarity 21.3%; Pred. No. 0.0015; 

Matches 35; Conservative 33; Mismatches 57; Indels 39; Gaps 4 

Qy 322 VNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEIDIKVADPWTFCETWET 381 

: | : : I I I I I : : t I M I : I I : : : : : = : = : I : I : I I = = I 

Db 4 LKKT FHS GVHTDEE S GQT 1 1 AGMGELHLDI I VDRMKRE FG- VE1ANI GKPMVAYRET I KKT 62 

0v 382 SSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWNRKKLGEFFQTKYDWDLLAA 441 

: I : I : I : I 1 : h I I 

Db 6 3 VEQEGKFVRQTGGKGKFGHVYVRLEPLDVEAA 94 

Qy 442 RSIWAFGPDATGPNILVDDTLPSE VDKALLGS VKDS I VQG 481 

: I : I : I I III: : I : : : I 

Db 95 GKEYEFAEEWG GWPKE F F GAVD KG I Q E RMKN GVLAG 132 



RESULT 12 

US-0 9-252-991A-19697 

Sequence 19697, Application US/09252991A 
Patent No. 6551795 
GENERAL INFORMATION: 
APPLICANT: Marc J. Rubenfield et al . 

TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 
PSEUDOMONAS 

TITLE OF INVENTION: AERUGINOSA FOR DIAGNOSTICS AND THERAPEUTICS 
FILE REFERENCE: 107196.136 

CURRENT APPLICATION NUMBER: US/ 09/252, 99 1A 
CURRENT FILING DATE: 1999-02-18 
PRIOR APPLICATION NUMBER: US 60/074,788 
PRIOR FILING DATE: 1998-02-18 
PRIOR APPLICATION NUMBER: US 60/094,190 
PRIOR FILING DATE: 1998-07-27 
NUMBER OF SEQ ID NOS : 33142 
SEQ ID NO 19697 
LENGTH: 10 67 
TYPE: PRT 

ORGANISM: Pseudomonas aeruginosa 
US-09-2 52-991A-19697 

Query Match 4.4%; Score 111; DB 4; Length 1067; 

Best Local Similarity 22.5%; Pred. No. 0.028; 

Matches 96; Conservative 65; Mismatches 132; Indels 134; Gaps 28; 

FTKKAPTSSSQRSFVEFILEPLYKILAQ — WGDVDTSLPRTLDELGIHLTKEELKLNIR 125 
||: I : : : | ! : I I : I I I : I I I I : : : : It I : 

FT RLRATA DWLEQIAKNLTKLLAQHPLPGD LGRLIEQV PELAREIK 675 

PLLRLVCKKFFGEFTGFVDMCVQHI P S PKV — GAKPKIEHTYTGGVD SD 172 

: | : : 1 : I I I : I : III! I = = : I 

TQQQFMFTACEE-IGDFRAGEDMEGRERPRHRFVGGWP — EHIREMGIELKKGFSKLTD 732 

LGEAMSD CDPDGPLMCHTTKMFSTHDGVQFHP-FGRVLSGTIHAGQPVKVLGENY 22 6 

I : : I I :| ::| :::| II :|: I I 

LFTRLTDILKEAMDGEG AGGIASHQAEEWYPLFGSLLA RAQGNWELWTAF 7 82 

TLEDEEDSQI CTVGRLWI S VAR YHIEVNRVP AGNWVL I EGVDQ P I VKT 27 4 

j 1 | : I I : | I : : : | I I I I I I : I I = I = 
TCEDPQDSP — PMAR- WLT LAES G S FYD I EANAS P I LAAET LRRNLWNVAYGV LVTS 836 

ATITE PRGNEEAQIFRPLKFNTTSVIKI AVEPVNPS 310 

| I : I It I : I I : : = : I I : 

ATLTALGT FDRYRMRAGLPRNAVTAWP S PFHHAEAGVLRVPDLKADPRNAAEHTAAI I R 8 96 

ELPKMLDGLR KWKSYPSL T T KVEE S GEHVI - LGT GE 34 6 

III::: I I : : : : t : I I : I : N 



: I I I I I : I : : I : I 

-YCEHW IAKIP FAVPDDPVEA 996 



Qy 


68 


Db 


630 


Qy 


126 


Db 


676 


Qy 


173 


Db 


733 


Qy 


227 


Db 


783 


Qy 


275 


Db 


837 


Qy 


311 


Db 


897 


Qy 


347 


Db 


957 



Qy 

Db 



407 GLAEDIE 413 

EM II 
997 ALAEWIE 1003 



RESULT 13 

US-0 9-107-532A-622 0 

; Sequence 6220, Application US/09107532A 
; Patent No. 6583275 

GENERAL INFORMATION: 
; APPLICANT: Lynn A Doucette-S tamm and David Bush 

TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 

ENTEROCOCCUS FAECIUM FOR DIAGNOSTICS AND 

THERAPEUTICS 

; NUMBER OF SEQUENCES : 7310 

CORRESPONDENCE ADDRESS : 

ADDRESSEE: GENOME THERAPEUTICS CORPORATION 
; STREET: 100 Beaver Street 

CITY: Waltham 
; STATE: Massachusetts 

COUNTRY: USA 
; ZIP: 02354 

; COMPUTER READABLE FORM: 

; MEDIUM TYPE: CD /ROM ISO9660 

; COMPUTER: PC 

; OPERATING SYSTEM: <Unknown> 

SOFTWARE: ASCII 
; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/ 0 9/ 1 0 7 , 532A 

FILING DATE: 30-Jun-1998 
; PRIOR APPLICATION DATA: 

; APPLICATION NUMBER: 60/085,598 

; FILING DATE: 14 May 1998 

APPLICATION NUMBER: 60/051571 
; FILING DATE: July 2, 1997 

ATTORNEY/AGENT INFORMATION: 
; NAME: Ariniello, Pamela Deneke 

; REGISTRATION NUMBER: 40,489 

; REFERENCE/ DOCKET NUMBER: GTC-012 

TELECOMMUNICATION INFORMATION: 
TELEPHONE: (7 81)8 93-5007 
TELEFAX: (7 81)8 93-8277 
INFORMATION FOR SEQ ID NO: 6220: 
SEQUENCE CHARACTERISTICS: 
; LENGTH: 122 4 amino acids 

; TYPE: amino acid 

; TOPOLOGY: linear 

; MOLECULE TYPE: protein 

HYPOTHETICAL: YES 
; ORIGINAL SOURCE: 

; ORGANISM: Enterococcus faecium 

; FEATURE : 

NAME/KEY: misc__f eature 
; LOCATION: (B) LOCATION 1...1224 

; SEQUENCE DESCRIPTION: SEQ ID NO: 622 0: 

US-09-107-532A-622 0 



Query Match 4.2%; Score 105; DB 4; Length 1224; 

Best Local Similarity 19.4%; Pred. No. 0.15; 

Matches 100; Conservative 63; Mismatches 171; Indels 182; Gaps 25; 

Qy 15 LLGNVCFSSSQYSICFTLGSFAKIY ADTFGDINYQEFAKRLWGDI YFNPKT 65 

IN | : i : I I 1 : t : I : I : I I I 

Db 338 LLGKRVHYSGRSVI — WG P FL KMYQ C G L P K EMAI E L FK P FVMRE LVQ RE LAS N I KN AK- 394 

Qy 66 RKFTKKAPTSSSQRSFVEFILEPLYKILAQWGDVDTSLPR — TLDELGIHLTKEEL 120 

||:: : : : : | | : : I I I I I I I : I 

Db 395 RKIERQE DEVWDVLEDVI KEHPVLLNRAPTLHRLGI QAFEPVLVQG 440 

Qy 121 -KLNIRPLLRLVCKKFFGEFTGFVDMCVQHIP SPKVGAKP 159 

: : I | | I : : : I I I I : I : I I I I I 

Db 441 RAIRLHP LVCEAYNADFDG — DQMAVH VP LNE EAQAEARMLMLAAQN I LN P KDG- KP 494 

Q y 160 KIEHTYTGGVDSDLGEAMSDCDPDGPLMCHTTKMFSTHDGVQFHPFGRVL SGTIHA 215 

III! : : I : I : I = : I ' I 

Db 495 w TPSQDMVLGNYYLTMEEEG REGEGMVFRDQNEAVIAWRNGYVHL 54 0 

Qy 216 GQPVKV LGEN-YTLEDEEDSQICTVGRLWISVA R 248 

: I | ! I : I : I : I I 1 I : : : 

Db 541 HSRIGVNPNSLGEKPFTEWQKERTMITTVGKIIFNEIMPPEFPYLNEPTDFNLTVQTPDK 600 

Q y 249 YH I E VN RVP AGNWVL I E GVDQ P I - VKT AT I T E P RGNEEAQI FRPLKFNTTSVIKI 302 

| : | 111:1 = I | | | : : I : I II 
Db 601 YFVEA GTDIPAHIKEQELVLPFKKKNLGNI IAEVFKRFKVTETS 644 

Qy 3 03 AVEPWPSELPKMLDGLRKWKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSE 3 62 

||||::: : : : = 

Db 645 KMLDRMKDLGYKHST HAG 662 

Qy 3 63 IDIKVAD-PWTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGL7VEDIEN-EWQIT 42 0 

I : :|| I: : ::| i :: I : =1 

Db 663 I T VGI ADI S VLNEKQE 1 1 E NAHKQVETITKQFRRGLITDDERYERVIAV 711 

Qy 421 WNRKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNI 456 

II I : I I I I I • I I 

D b 712 WNAAK — DSIQQKLMEGLEAKNPIFMMSDSGARGNI 745 



RESULT 14 

US- 0 9-32 8-352-8132 

; Sequence 8132, Application US/09328352 
; Patent No. 6562958 
; GENERAL INFORMATION: 

APPLICANT: Gary L. Breton et al . 
; TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 
ACINETOBACTER 

; TITLE OF INVENTION: BAUMANNII FOR DIAGNOSTICS AND THERAPEUTICS 
FILE REFERENCE: GTC99-03PA 

CURRENT APPLICATION NUMBER: US/ 0 9/32 8,352 
; CURRENT FILING DATE: 1999-06-04 
; NUMBER OF SEQ ID NOS : 8252 
; SEQ ID NO 8132 
LENGTH: 13 8 0 
; TYPE: PRT 



ORGANISM: Acinetobacter baumannii 
US- 0 9- 32 8-352-8 132 



Query Match 4.1%; Score 103; DB 4; Length 1380; 

Best Local Similarity 19.5%; Pred. No. 0.31; 

Matches 123; Conservative 99; Mismatches 192; Indels 218; Gaps 32; 

Qy 25 QYSICFT LGSFAKI YADTF GDINYQEFAKRLWGDI 59 

I : : I IE IN Ml I : : I i : I : : : 

Db 351 QFNILFTNDIDRGSFV ADTLRADLTRDREEALVEI YKVMRPGEPPTKEAAENLFNNL 4 07 

Qv 60 YFNPK TRKFTKKAPTSSSQRS-FVEFIL — EPLYKI LAQW G 98 

: | : : | : : : I : I M I I I : : I = I I 

Db 408 FFSSERYDLSPVGRMKFNRRLGRPYEVGTDQKSREVEGILSHEDIIDVLRTLVEIRNGKG 467 

Oy 99 DVDTSLPRTLDELG IHLTKEELK LNI 124 

: I I : I I I HI ; l 

Db 468 EVD DIDHLGNRRVRSVGEMTENQFRVGLVRVERAVKERLSQAETDNLSPQDLINA 522 

Qy 125 RPLLRLVCKKFFG— EFTGFVDMCVQHI PSPKVGAKPKIEHTYTGGV 169 

: | : : I : I I I : : I : I I : I : = I * : I I : 

Db 523 KPVAAAI -KEFFGS SQLSQFMD QNNPLSEITHKRRVSALGPGGLTRERAGFEVRDVH 57 8 

Qv 170 DSDLGEAMSDCDPDGPL — MCHTTKMFSTHDGVQF — HPFGRVLSGTI 213 

: | |:| | ::::::: I |: :|: 1 = 

Db 579 QTHYGRVCPIETPEGPNIGLINSLSVYAKANDFGFLETPYRKWDGRVTDDVEYLSAIEE 638 

Q 214 HAGQPVKVLGENYTLEDEEDSQI CTVGRLWI SVA 24 7 

I I: I:: I I I I : : I ' 

Db 639 VGTVIAQADSAVDKDGNLTEEFVSVRHQGEFVRMPPEKVTHMDVSAQQWSVAASLIPFL 69 8 

Qy 248 RYH I EVNRVPAGNWVLI EGV DQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKI 302 

| : M | : : : | I : I : I I I I : I : I I 

Db 699 E-HDDANRALMGSNMQRQAVPTLRADKPLVGTGM EANVAR DSGVCVI 744 

Qy 303 A VEPVNPSEL PKMLDG LRKVNKSYPSLTTKVEES GE 338 

| :| |: I : :|: I | | :| : | : :: |: 

Db 745 ANRGGVIEYVDASRIVIRVNEDEMVAGEAGVDIYNLIKYTRS— NQNTCINQNVIVNLGD 8 02 

Qv 339 HVILGTGELYLDCVMHDLRKMYSEIDIKV7VDPWTFCETWETSSLKCFAETPNKKNKIT 398 

| | : : | | : : : : : : I I : i : I I I : I : : : : : I 

Db 803 KV — ARGDI LADGPSTDMGEIiALGQNMRVA — FMTWNGYNYEDS I L LS ERVLQEDRLT 856 

0v 399 MIAEPLEKGLAEDIENEVVQITWNRKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILV 458 

| : | I : : | | : : I I : I I I I : I : I I I 

Db 857 SIHIQELSCVARDTKLGAEEITADI PNVGEAALSKLD ESGIVYIGAEVTAGDILV 911 

Qy 459 DDTLP SEVDKALLG SVKDS 477 

| : : : I : I I I I I 

Db 912 GKVTPKGETQLTPEEKLLRAIFGEKAADVKDS 943 



RESULT 15 

US-09-2 52-9 91A-32 924 

; Sequence 32924, Application US/09252991A 
; Patent No. 6551795 
; GENERAL INFORMATION: 



; APPLICANT: Marc J. Rubenfield et al . 

; TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 
PSEUDOMONAS 

TITLE OF INVENTION: AERUGINOSA FOR DIAGNOSTICS AND THERAPEUTICS 
FILE REFERENCE: 107196.136 

CURRENT APPLICATION NUMBER: US/ 09/252 , 991A 
CURRENT FILING DATE: 1999-02-18 
PRIOR APPLICATION NUMBER: US 60/074,788 
PRIOR FILING DATE: 1998-02-18 
PRIOR APPLICATION NUMBER: US 60/094,190 
PRIOR FILING DATE: 1998-07-27 
NUMBER OF SEQ ID NOS : 33142 
SEQ ID NO 32924 
LENGTH: 67 7 
TYPE: PRT 

ORGANISM: Pseudomonas aeruginosa 
US-0 9-252-991A-32 92 4 

Query Match 4.0%; Score 100.5; DB 4; Length 677; 

Best Local Similarity 18.4%; Pred. No. 0.17; 

Matches 67; Conservative 55; Mismatches 113; Indels 129; Gaps 16; 

Qy 63 PKTRKFTKKAPTSSSQRSFVEFILEPLYKILAQWGDVDTSLPR TLDELG 112 

|: | | |:| | : I I: 11=1 I 1 , 

Db 179 PQTRFVTQKAFKAG LRPIV WNKIDRPGARPDWVIDQIFDLFDNLG 224 

Qv 113 I HLTKEELKLNI RPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYT 166 

I |:| I : I :l = 1 = 1=1 I 
Db 22 5 — ATDEQLDFPIVYASALNGIAGLDHEKMDDNMDALFQAIIDHVPAPW 271 

Oy 167 GGVDSDLGEAMSDCDPDGPLMCHTTKM-FSTHDGVQFHPFGRVLSGTIHAGQPVKVLGEN 225 

| : | | ::::::!! I I = I = = I I = = = 

Db 272 DTEGPFQMQISQLDYNSFLGV — I GI GRI TRGKVKSNT P WAI S DD 315 

Qy 226 YTLEDEEDSQI CTVGRLWI SVARYH 1 E VN RVP AGNWVL I E GVDQ P I VKT AT I T E P R 281 

: ; ||::: : | : | | 11= ! = I = = = = : I = = I = 

Db 316 GSKRN GRI-LKIMGHHGLQRVEVAEAEAGDIVCVSGMEELFI-SDTLCDPQ 364 



Qy 



282 G NEEAQI FRPLKFNTTSVIKI AVEP VNPSEL 312 

I : I M I = I I = I I I : 

Db 365 NVEALP PLTVDQPTVSMTFQVNDS P FAGREGKFVT SRNI KERLEKELLHNVALRVEPGDS 424 

0v 313 PKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEIDIKVADPW 372 

| : : : I I I I = I = = = I = ==111 

Db 425 PE KFKVSGRGELHLSVLIETMRR — EGFELAVGRPEV 459 

Qy 373 TFCE 376 

I 

Db 460 VIIE 463 



Search completed: January 30, 2004, 11:27:43 
Job time : 29.1381 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on : 



Title: 

Perfect score: 
Sequence : 



January 30, 2004, 11:21:12 ; Search time 26.2023 Seconds 

(without alignments) 

1765.382 Million cell updates/sec 

US-09-989-481-3 
2527 

1 RAYYSTDENLILSPLLGNVC LPSEVDKALLGSVKDSIVQG 4 81 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 283308 seqs, 96168682 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



283308 



Database 



PIR_76: * 
pirl : * 
pir2 : * 
pir3: * 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
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translation elonga 
hypothetical prote 
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40 
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6 
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2 


T23556 


41 
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8 . 


9 
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2 


E75536 


42 
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7 
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43 
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2 
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44 


207.5 


8 . 


2 


827 
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elongation factor 
translation elonga 
translation elonga 
translation elonga 
translation elonga 
probable translati 
probable translati 
probable translati 
translation elonga 
probable membrane 
translation elonga 
translation elonga 
translation elonga 
translation elonga 
translation elonga 
translation elonga 
translation elonga 
translation elonga 
translation elonga 
translation elonga 
translation elonga 
translation elonga 
translation elonga 
translation elonga 
translation elonga 
translation elonga 
hypothetical prote 
translation elonga 
translation elonga 
translation elonga 
hypothetical prote 
translation elonga 



ALIGNMENTS 



RESULT 1 
T29007 

translation elongation factor eEF-2 homolog eft-1 [similarity] - Caenorhabditis 
elegans 

C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 03-Nov-2000 
C; Accession : T2 90 07 
R;Favello, A. 

submitted to the EMBL Data Library, March 1996 

A; Description: The sequence of C. elegans cosmid ZK328. 

A;Reference number: Z20552 

A;Accession: T29007 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A; Residues: 1-974 <FAV> 

A; Cross-references : EMBL:U5 0193; PIDN : AAA9124 8 . 1; CESP: eft-1 

A; Experimental source: strain Bristol N2 

C; Genetics : 

A; Gene : CESP : eft-1 

A;Introns: 210/1; 335/1; 686/2; 876/3 



C;Superfamily: translation elongation factor 2; translation elongation factor Tu 
homology 

F; 133-2 64 /Domain : translation elongation factor Tu homology <ETU> 

Query Match 69.8%; Score 1764.5; DB 2; Length 974; 

Best Local Similarity 69.1%; Pred. No. l.le-122; 

Matches 331; Conservative 66; Mismatches 81; Indels 1; Gaps 1; 



WY 


4 


YSTDENLILSPLLGNVCFSSSQYSICFTLGSFAKI YADTFGD-INYQEFAKRLWGDI YFN 
: : : : : 1 1 M M 1 1 I 1 : I : : | I : | | | : | | | | | f : | | | : | | | | | | | | 
FAEEDVPVLS PLNGNVI FS S GRYNVCFSLLS FSNI YAKQHGDS FNSKEFARRLWGDI YFE 


62 


JJO 


2 95 


354 


wy 


rr q 

DO 


PKTRKFTKKAPTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKL 
1 : 1 : : 1 : 1 1 : 1 1 1 1 1 : 1 1 1 : 1 1 I 1 1 f 1 1 II : | | | | | : | | f | : 

kktrkfvkkspshdaprtfvqfilepmykifsqwgdvdtclpdvmaelgirlskeeq™ 


122 


UJD 


355 


414 






NIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDP 
i : M 1 : 1 : 1 1 : 1 1 | : | : | | | : | | : | I | II III I I I | | | : | | : 

NVRPLIALICKRFFGDFSAFVDLWQNIKSPLENAKTKIEQTYLGPADSQLAQEMQKCNA 


182 




/lie 
4 1 3 


474 


Qy 


183 


DGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRL 
: I 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 : 1 1 1 : 1 1 : 1 1 1 I I | : : : | | | | : | | | | | 
EGPLMVHTTKNYPVDDATQFHVFGRVMSGTLEANTDVRVLGENYSIQDEEDCRRMTVGRL 


242 


Db 


475 


534 


Qy 


243 


WISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKI 
: : ' I M 1 : I 1 1 1 1 1 1 I I I I : | | | | | | | | | | | f : MINIMI | : | : 

FVRVASYQIEVSRVPAGCWVLIEGIDQPIVKTATIAELGYEEDVYIFRPLKFNTRSCVKL 


302 


Db 


535 


594 


Qy 


303 


AVEPVNPSELPKMLDGLRKVNKSYPSLTTJCVEESGEHVILGTGELYLDCVMHDLRKMYSE 
• 1 1 1 : 1 1 I 1 1 1 1 1 1 1 I I I I | f | | | | | | | : | | | M | | | : | | | | j | : j 1 I 1 1 1 : II : : ! | 

AVEPINPSELPKMLDGLRKVNKSYPLLTTRVEESGEHVLLGTGEFYMDCVMHDMRKVFSE 


362 


Db 


595 


654 


Qy 


363 


IDIKVADPVVTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGL7VEDIENEWQITWN 

INN I I | : | M : | | | | i | | M 1 1 1 1 1 1 1 : | 1 | | | 1 1 II 1 I 1 I I 

IDIKVADPWTFNETVIETSTLKCFAETPNKKNKITMMAEPLEKQLDED1ENEWQIGWN 


422 


Db 


655 


714 


Qy 


423 


RKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGSVKDSIVQG 481 

1 : : 1 1 1 M 1 1 1 1 : 1 I 1 i 1 I 1 1 I 1 1 M I 1 | | | | | | : | MM II : | : : | : | | | 

RRRLGEFFQTKYNWDLLAARSIWAFGPDTTGPNILLDDTLPSEVDKHLLSTVRESLVQG 773 


Db 


715 



RESULT 2 
H86197 

hypothetical protein [imported] - Arabidopsis thaliana 
C; Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: 02-Mar-2001 #sequence_revision 02-Mar-2001 #text change 31-Mar-2001 
C;Accession: H86197 — 

R;Theologis, A.; Ecker, J.R.; Palm, C.J.; Federspiel, N.A.; Kaul, S.; White, o., 
Alonso, J.; Altaf, H.; Araujo, R. ; Bowman, C.L.; Brooks, S.Y.; Buehler, E.; 
Chan, A.; Chao, Q.; Chen, H.; Cheuk, R.F.; Chin, C.W.; Chung, M.K.; Conn, L . ; 
Conway, A.B.; Conway, A.R.; Creasy, T.H.; Dewar, K. ; Dunn, P.; Etgu, P.; 
Feldblyum, T.V.; Feng, J.; Fong, B.; Fujii, C.Y.; Gill, J.E.; Goldsmith, A.D.; 
Haas, B.; Hansen, N.F.; Hughes, B.; Huizar, L. 
Nature 408, 816-820, 2000 

A;Authors: Hunter, J.L.; Jenkins, J. ; Johnson-Hopson, C; Khan, S.; Khaykin, E.; 
Kim, C.J.; Koo, H.L.; Kremenetskaia, I.; Kurtz, D.B.; Kwan, A.; Lam, B.; Langin- 
Hooper, S.; Lee, A.; Lee, J.M.; Lenz, C.A.; Li, J.H.; Li, Y.; Lin, X.; Liu, 



S.X.; Liu, Z.A.; Luros, J.S.; Maiti, R. ; Marziali, A.; Militscher , J.; Miranda 
M.; Nguyen, M. ; Nierman, W.C.; Osborne, B.I.; Pai, G.; Peterson, J.; Pham, P K 
Rrzzo, M.; Rooney, T.; Rowley, D . ; Sakano, H. 

A;Authors: Salzberg, S.L.; Schwartz, J.R.; Shinn, P.; Southwick, A.M.; Sun, H. 
Tallon, L.J.; Tambunga, G.; Toriumi, M.J.; Town, CD.; Utterback, T . ; van Aken 
S.; Vaysberg, M. ; Vysotskaia, V. S . ; Walker, M. ; Wu, D.; Yu, G.; Fraser, CM.; 
Venter, J.C.; Davis, R.W. 

A; Title: Sequence and analysis of chromosome 1 of the plant Arabidopsis . 

A;Reference number: A86141; MUID : 2 1 01 6719 ; PMID : 1113 07 12 

A;Accession: H86197 

A; Status: preliminary 

A;Molecule type: DNA 

A; Residues: 1-987 <STO> 

A;Cross-references: GB:AE005172; NID : g8 84 4 127 ; PIDN : AAF8 02 19.1; GSPDB: GN00141 

C; Genetics : 

A; Map position: 1 

C;Superfamily: translation elongation factor 2; translation elongation factor 1 
homology 

Query Match 60.5%; Score 1528.5; DB 2; Length 987; 

Best Local Similarity 61.2%; Pred. No. 3.8e-105; 

Matches 289; Conservative 69; Mismatches 113; Indels 1; Gaps 1; 



Qy 


11 


ILSPLLGNVCFSSSQYSICFTLGSFAKI YADTFG— DINYQEFAKRLWGDI YFNPKTRKFT 
: : 1 1 1 1 1 1 : 1 III MM:!! ! :: : | | f I I | | : | : : | | | 
LID PAAGNVC FAS GTAGWS FT LQ S FAKMYAKLHGVAMDVDKFAS RLWGD VYYH S DT RVFK 


69 


Db 


312 


371 


Qy 


70 


KKAPTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLNIRPLLR 
: 1 = 1 : 1 1 : 1 1 ! 1 1 1 ! ! ! : 1 ! : 1 : I : ! 1 III: I : 1 1 f : I 1 1 I I 
RSPPVGGGERAFVQFILEPLYKI YSQVIGEHKKSVETTLAELGVTLSNSAYKLNVRPLLR 


129 


Db 


372 


431 


Qy 


130 


LVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDPDGPLMCH 
'I 'I =11111:11111: 1 1 : : 1 : 1 1 1 1 1 : 1 : I : I | | | | | | : 
LACSSVFGSASGFTDMLVKHIPSPREAAARKVDHS YTGTKDSPI YESMVECDPSGPLMVN 


189 


Db 


432 


491 


Qy 


190 


TTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLWI SVARY 
1 ' ; : 1 1 1 1 1 f 1 1 : . 1 1 1 : 1 1 1 1 1 : 1 1 | | | I | : || 1 III 
VT KL YPKS DT S VFDVFGRVYS GRLQTGQ S VRVLGEGYS P EDEEDMT I KEVTKLWI YQARY 


249 


Db 


492 


551 


Qy 


250 


HIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIAVEPVNP 
' 1 : 11:11111111 1 : 1 I I | : : | : IN 1 : I f I |:| | ||:|| 
RIPVS SAPPGSWVLIEGVDASIMKTATLCNAS YDEDVYIFRALQFNTLPVVKTATEPLNP 


309 


Db 


552 


611 


Qy 


310 


SELPKMLDGLRKWKSYPSLTTWEESGEHVILGTGELYLDCVMHDLRKMYSEIDIKVAD 
1 1 f 1 1 1 : : 1 1 1 1 : : 1 1 I 1 1 1 M 1 1 I I I 1 1 1 M 1 1 i 1 1 : 1 1 I 1 : : I I | : : : f | | | 
SELPKMVEGLRKISKSYPLAITKVEESGEHTILGTGELYLDSIMKDLRELYSEVEVKVAD 


369 


Db 


612 


671 


Qy 


370 


PVVTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWNRKKLGEF 

1 1 1 : 1 1 1 1 1 1 1 : 1 1 : M 1 1 1 1 1 1 | | M 1 1 1 :: I I M 1 1 1 1 II 1 1 1 1 I : 1 I : I 

PWSFCETWESSSMKCFAETPNKKNKITMIAEPLDRGLAEDIENGWSIDWNRKQLGDF 


429 


Db 


672 


731 


Qy 


430 


FQTKYDWDLLAARS IWAFGPDATGPNI LVDDTLP S EVDKALLGSVKDS I VQG 4 81 

1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E I 1 1 1 1 1 1 : I I I I 1 : 1 1 1 : 1 : : 1 1 1 1 f 1 il 

FRT KYDWD LLAARS I WAFGPDKQGPN I LLDDTLPTEVDRNLMMAVKDS I VQG 7 83 




Db 


732 





RESULT 3 



T39902 

translation Elongation Factor 2 - fission yeast ( Schizosaccharomyces pomhe) 
C; Species: Schizosaccharomyces pombe 

C;Date: 03-Dec-1999 #sequence__revision 03-Dec-1999 #text_change 08-Sep-2000 
C; Accession: T39902 

R;Lyne, M. ; Rajandream, M.A. ; Barrell, B.G.; Rieger, M. 
submitted to the EMBL Data Library, November 1998 
A;Reference number: Z21889 
A;Accession: T39902 

A; Status: preliminary; translated from GB/EMBL/DDB J 
A;Molecule type: DNA 
A; Residues: 1-983 <LYN> 

A; Cross-references : EMBL : AL033534 ; PIDN : CAA22 12 6 . 1 ; GSPDB : GN00067 ; 
SPDB: SPBC215 . 12 

A; Experimental source: strain 972h-; cosmid c215 
C; Genetics : 

A; Gene : SPDB : SPBC215 . 12 
A; Map position: 2 
A;Introns: 8/2; 128/2; 245/3 

C; Superf amily: translation elongation factor 2; translation elongation facte 
homology 



Query Match 52.8%; Score 1335; DB 2; Length 983; 

Best Local Similarity 53.7%; Pred. No. 8.7e-91; 

Matches 257; Conservative 74; Mismatches 146; Indels 2; Gaps 1 



Qy 


5 


STDENLILSPLLGNVCFSSSQYSICFTLGSFAKIYADTFGDINYQEFAKRLWGDIYFNPK 
If : 1 1 1 1 1 I 1 1 : 1 1 1 1 1 1 1 I 1 : f f I I : E : 1 ! f 1 1 1 I I ] : ! 
SKDLKYRVSPELGNVCFASCDLGYCFTLSSFAKLYIDRHGGIDVDLFSKRLWGDI YFDSK 


64 


Db 


304 


363 


Qy 


65 


TRKFTKKAPTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLNI 


124 


Db 


364 


Mill- 1 1 1 1 1 1 1 1 I I 1 1 1 : : | | : | , : I : : | : 

TRKFAKQSLDGSGVRSFVHFILEPLYKLHTLTISDEAEKLKKHLSSFQI YLKPKDYLLDP 


423 


Qy 


125 


RPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDC — DP 
= 1 1 1 : 1 : 1 1 II III: : I I I I 1 : 1 1 : | | : : | : | : | : : : 
KPLLQLICASFFGFPVGFVNAVTRHIPSPRENAARKASQSYIGPINSSIGKAILEMSREE 


182 


Db 


424 


483 


Qy 


183 


DGPLMCHTTfCMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRL 

1 1 : 1 M : : : 1 1 1= Mill: III 1 1 : 1 1 1 1 1 1 : : : 

SAPLVKHWKLYNTVDANNFYAFARVYSGQVKKGQKVKVLGENYSLEDEEDMVVAHIAEI 


242 


Db 


484 


543 


Qy 


243 


WISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKI 
; 1 1 1 : 1 : II 1 1 : 1 1 1 1 1 1 1 1 I : : | | | | : : I I I : 
CVPCARYRLHVDGAVAGMLVLLGGVDNSISKTATIVSDNLKDDPYIFRPIAHMSESVFKV 


302 


Db 


544 


603 


Qy 


303 


AVEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSE 
1 I 1 1 E 1 I 1 M 1 : M 1 1 f 1 1 1 1 1 1 1 1 1 1 1 II 1 1 I 1 1 1 1 : 1 : 1 I : : : | | | : | | | 
AVEPHNPSELPKLLDGLRKTNKS YPLSITKVEESGEHTIFGTGEMYMDCLLYDLRTLYSE 


362 


Db 


604 


663 


Qy 


363 


IDIKVADPVVTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGIAEDIENEWQITWN 
1 : 1 : E : 1 ! 1 1 1 1 1 1 : E 1 I : E IE : : 1 I 1 II 1 : 1 1 1 : 1 I I I I I : : | M E 1 1 1 
IEIRVSDPVARFCETAVDTSSIKCFSDTPNKKNRITMWEPLEKGISNDIENGKVNINWP 


422 


Db 


664 


723 


Qy 


423 


RKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGSVKDSIVQG 4 
: 1 : • 1 M 1 M 1 1 1 1 I : | | | | | | | | I I EM | | | | : : |[ | ; j | | | : j | | 
QKR1SEFFQKNYDWDLLASRSIWAFGPDDRGTNILRDDTLSTDVDKNVLNSVKEYIKQG 7 


81 


Db 


724 


82 



RESULT 4 
EFHU2 

translation elongation factor eEF-2 - human 
C; Species: Homo sapiens (man) 

C;Date: 31-Mar-1992 #sequence_revision 03-Oct-1995 #text change 19-Jan-2001 

C;Accession: S18294; S00467; H33178; S06137 "~ 

R;Rapp, G. ; Klaudiny, J.; Hagendorff, G. ; Luck, M.R.; Scheit, K.H. 

Biol. Chem. Hoppe-Seyler 370, 1071-1075, 1989 

A; Title: Complete sequence of the coding region of human elongation factor 2 
(EF-2) by enzymatic amplification of cDNA from human ovarian granulosa cells 
A; Reference number: S06137; MUID : 90121741 ; PMID:2610926 
A;Accession: S18294 
A; Molecule type: mRNA 
A; Residues: 1-858 <RAP> 

A;Cross-references: EMBL:X51466; NID:g31105; PIDN : CAA3582 9 . 1 ; PID:g31106 
R;Rapp, G. ; Mucha, J.; Einspanier, R. ; Luck, M. ; Scheit, K.H. 
Biol. Chem. Hoppe-Seyler 369, 247-250, 1988 

A; Title: Cloning and sequence analysis of a cDNA from human ovarian granulosa 

cells encoding the C-terminal part of human elongation factor 2. 

A; Reference number: S00467; MUID : 88293714 ; PMID:2840927 

A;Accession: S00467 

A; Molecule type: mRNA 

A; Residues: 501-858 <RA2> 

A; Cross-references: EMBL:M19997; NID:gl81968; PIDN: AAA50388 . 1 ; PID:gl81969 

A; Note: the amino end of the mature protein was determined by protein sequencing 

R;Ward, L.D.; Hong, J.; Whitehead, R.H.; Simpson, R.J. 

Electrophoresis 11, 883-891, 1990 

A; Title: Development of a database of amino acid sequences for human colon 
carcinoma proteins separated by two-dimensional polyacrylamide gel 
electrophoresis . 

A; Reference number: A33178; MUID : 9117 6935 ; PMID:2079031 

A; Accession: H3317 8 

A; Molecule type: protein 

A;Residues: 2-14 , 1 X ', 16, ' X ' , 18-19 <WAR> 

A; Note: the amino end of the mature protein was determined 

C; Comment: Phosphorylation is regulatory and inactivates eEF-2. 

C; Genetics : 

A; Gene: GDB : EEF2 ; EF2 

A;Cross-ref erences : GDB: 119104; OMIM: 130610 
A; Map position: 19pter-19ql2 

C;Superfamily: translation elongation factor 2; translation elongation factor Tu 
homology 

C;Keywords: diphthamide; GTP binding; nucleotide binding; P-loop; 
phosphoprotein; protein biosynthesis 

F; on!f 8/ / r ° dUCt: translation elongation factor eEF-2 #status predicted <MAT> 
F;20 161/Domam: translation elongation factor Tu homology <ETU> 
F;26-33/Region: nucleotide-binding motif A (P-loop) 
F; 158-1 61/Region : GTP-binding NKXD motif 

F;57 59/Binding site: phosphate (Thr) (covalent) (by elongation factor 2 kinase) 
tfstatus predicted 

F;715/Modified site: 2 '-[ 3-carboxamido-3- ( t rimethylammonio ) propyl ] histidine 
(His) #status predicted 



Query Match 34.4%; Score 870; DB 1 ; Length 858; 

Best Local Similarity 37.3%; Pred. No. 2.4e-56; 



Matches 182; Conservative 95; Mismatches 189; Indels 22; Gaps 5 

Qy 9 NLILSPLLGNVCFSSSQYSICFTLGSFAKIYADTF GDI NYQEFAKRLW 56 

I : : : I : I I I I I : ill I I : : I I f : : : | : I I 

Db 2 02 NIMIDPVLGTVGFGSGLHGWAFTLKQFAEMYVAKFAAKGEGQLGPAERAKKVEDMMKKLW 2 61 

QY 57 GDIYFNPKTRKFTKKAPTSSSQ RS FVEFILEPLYKI LAQWGDVDTSLPRTLDELGI 113 

I I I I : I I I : I I : : I : I : I I : I : : I : : : : : : : | | 

Db 2 62 GDRYFDPANGKFSKSATSPEGKKLPRTFCQLILDPIFKVFDAIMNFKKEETAKLIEKLDI 321 

Qy H4 HLTKEELKLNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDL 173 

I I * : | | | : f : : : : | | : | j | | : | | | | : 

Db 322 KLDSEDKDKEGKPLLKAVMRRWL PAGDALLQMITIHLPS PVTAQKYRCELLYEGPPDDEA 381 

Qy 174 GEAMSDCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEED 233 

■ I I I I I I I : : I I II : I : I I I I j I : I |:::| ||| : | | 
Db 382 AMGIKSCDPKGPLMMYISKMVPTSDKGRFYAFGRVFSGLVSTGLKVRIMGPNYTPGKKED 441 

QY 234 SQICTVGRLWISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLK 293 

: I : : I I : I I I I I : I I I I : I I I I I I IE I : I 

Db 442 L YLKPI QRT I LMMGRYVEPI EDVPCGNI VGLVGVDQFLVKTGTITT FEHAHNMRVMK 498 

Qy 2 94 FNTTSVIKIAVEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVM 353 

I : : I : : : I I I I I : : I I I : : : I I : : : II I : : I I f I I I : I I I I I : I : : 
Db 499 FSVSPWRVAVEAKNPADLPKLVEGLKRLAKSDPMVQCIIEESGEHIIAGAGELHLEICL 558 

Qy 354 HDLRKMYSEIDIWADPVVTFCETVVETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIE 413 

I I : : : I I I : I I I I : : I I I I I : : I : : : I I I I : : | | | 11,11': 
Db 559 KDLEED^CIPIKKSDPWSYRETVSEESNVLCLSKSPNKHNRLYMKARPFPDGLAEDID 618 

Qy 414 NEWQITWNRKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGS 4 73 

I i : : I I : I I : I I I I I II I M I I I I I I : I 
Db 619 KGEVSARQELKQRARYLAEKYEWDVAEARKIWCFGPDGTGPNILTDITKGVQ YLNE 674 

Qy 474 VKDSIVQG 481 

: I I I : I I 
Db 675 IKDSWAG 682 

RESULT 5 
EFRT2 

translation elongation factor eEF-2 - rat 
C; Species: Rattus norvegicus (Norway rat) 



C;Date: 28-Feb-1990 #sequence_revision 02-Dec-1994 #text_change 19-Jan-2001 
C;Accession: S04007; B25440; A11180; S02526; S62217 
R;Oleinikov, A.V. ; Jokhadze, G.G.; Alakhov, Y.B. 
FEBS Lett. 248, 131-136, 1989 

A;Title: Primary structure of rat liver elongation factor 2 deduced from the 
cDNA sequence . 

A/Reference number: S04007; MUID : 8 92 52 028 ; PMID:2721670 
A;Accession: S04007 
A; Molecule type: mRNA 
A; Residues: 1-693 <OLE> 

A; Cross-references : EMBL : Y07504 ; NID : g5608 1 

A;Note: the sequence in GenBank entry RNEF2R (PID:g56082) includes the sequence 
of B25440 not determined by these authors 



R;Kohno, K. ; Uchida, T . ; Ohkubo, H.; Nakanishi, S.; Nakanishi, T . ; Fukui T • 
Ohtsuka, E . ; Ikehara, M. ; Okada, Y. 

Proc. Natl. Acad. Sci. U.S.A. 83, 4978-4982, 1986 

A; Title: Amino acid sequence of mammalian elongation factor 2 deduced from the 

cDNA sequence: homology with GTP-binding proteins. 

A; Reference number: A25440; MUID : 8 62597 1 6 ; PMID: 3014523 

A;Accession: B25440 

A;Molecule type: mRNA 

A; Residues: 516-858 <KOH> 

A;Cross-references: GB:K03502; NID:g203996; PIDN : AAA4 110 6 . 1 ; PID:g203997 
A;Note: nucleotide differences with hamster shown 
R; Robinson, E.A.; Henriksen, O.; Maxwell, E.S. 
J. Biol. Chem. 249, 5088-5093, 1974 

A;Title: Elongation factor 2. Amino acid sequence at the site of adenosine 
diphosphate ribosylation . 

A;Reference number: A11180; MUID : 743 012 60 ; PMID:4368673 

A; Accession : All 18 0 

A;Molecule type: protein 

A;Residues: 702-714 ,' X ' , 716 <ROB> 

A; Experimental source: liver 

R;Nilsson, L.; Nygard, O. 

Eur. J. Biochem. 171, 293-299, 1988 

A; Title: Structural and functional studies of the interaction of the eukaryotic 

elongation factor EF-2 with GTP and ribosomes. 

A; Reference number: S02526; MUID : 88111683 ; PMID:3338467 

A;Accession: S02526 

A;Molecule type: protein 

A;Residues: 2-14 ; 68-79; 572-585 <NIL> 

R;Guillot, D.; Vard, C; Reboud, J. P. 

Eur. J. Biochem. 236, 149-154, 1996 

A; Title: Photoaf f ini ty labeling of elongation factor-2 with 8-azido derivatives 
of GTP and ATP. 

A; Reference number: S62217; MUID : 961 84 8 92 ; PMID: 8617259 

A; Access ion: S 622 17 

A; Status: preliminary 

A;Molecule type: protein 

A;Residues: 2-7 ; 24 0-244 ; 265-272 <GUI> 

C; Comment: Phosphorylation is regulatory and inactivates eEF-2 

C; Super family: translation elongation factor 2; translation elongation factor Tu 
homology 

C; Keywords: diphthamide; GTP binding; nucleotide binding; P-loop; 
phosphoprotein; protein biosynthesis 

F;2-858/Product: translation elongation factor eEF-2 #status predicted <MAT> 
F;20-161/Domam: translation elongation factor Tu homology <ETU> 
F;26-33/Region: nucleotide-binding motif A (P-loop) 
F; 15 8-1 61 /Region : GTP-binding NKXD motif 

F,*57 59/Binding site: phosphate (Thr) (covalent) (by elongation factor 2 kinase) 
ffstatus predicted 

F;715/Modified site: 2 ■- [ 3-carboxamido-3- ( trimethylammonio ) propyl ] histidine 
(His) #status experimental 

Query Match 34.3%; Score 868; DB 1; Length 858; 

Best Local Similarity 37.7%; Pred. No. 3.3e-56; 

Matches 184; Conservative 92; Mismatches 190; Indels 22; Gaps 5; 

Qy 9 NLI LSPLLGNVCFSSSQYSICFTLGSFAKIYADTF GDINYQEFA KRLW 5 6 

I : : : I : I i I I I : Ml | | : : | | | . , , , . 



Db 


202 


NIMIDPVLGTVGFGSGLHGWAFTLKQFAEMYVAKFAAKGEGQLGAAERAKKVEDMMKKLW 


261 


Qy 


57 


GDIYFNPKTRKFTKKAPTSSSQ RS FVEFILEPLYKILAQWGDVDTSLPRTLDELGI 


113 


Db 


262 


1 1 I f : 1 1 1 : 1 1 : : 1 : ! : I I : f : : | : : : : : : : | | 

GDRYFDPANGKFSKSANSPDGKKLPRTFCQLILDPIFKVFDAIMNFRKEETAKLIEKLDI 


321 


Qy 


114 


HLTKEELKLNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDL 

1 1 : : | | 1 : [ : : : : | 1 : I I | [ : | | | | : 
KLDSEDKDKEGKPLLKAVMRRWLPAGDALLQMITIHLPSPVTAQKYRCELLYEGPPDDEA 


173 


Db 


322 


381 


Qy 


174 


GEAMSDCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEED 
= 1 1 1 1 1 1 1 : : 1 1 II : I : 1 1 1 1 1 1 : | I : : : | | | | : | | 
AMGIKSCDPKGPLMMYISKMVPTSDKGRFYAFGRVFSGWSTGLKVRIMGPNYTPGKKED 


233 


Db 


382 


441 


Qy 


234 


SQICTVGRLWISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLK 


293 


Db 


442 


: : i : = 1 1 : 1 1 1 1 1 : 1 I 1 1 : 1 I I I I I | | | : | 
L YL K P I Q RT I LMMG R YVE PIE D VP C GN I VGL VGVD Q FL VKT GT ITT FEHAHNMRVMK 


498 


Qy 


294 


FNTTSVIKIAVEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVM 

1 : : 1 : : : : : 1 1 1 :: : 1 1 : : : i I I : : 1 ! I 1 M : 1 j I I I : I : : 

FSVSPWRVAVEAKNPADLPKLVEGLKRLAKSDPMVQCIIEESGEHIIAGAGELHLEICL 


353 


Db 


499 


558 


Qy 


354 


HDLRKMYSEIDIKVADPVVTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIE 

II : : : 1 II : 1 1 1 1 * • Ml 1 I • • t • ■ • I I I I • • I i i t i i i i i . 
11 ■ * 1 11 • i i t t • - i i i i i . . i ... | | | | . . | [ | 1 1 1 1 1 1 : 

KDLEEDHACIPIKKSDPWSYRETVSEESNVLCLSKSPNKHNRLYMKARPFPDGLAEDID 


413 


Db 


559 


618 


Qy 


414 


NEWQITWNRKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGS 

1 1 : 1 1 : 1 1 : 1 1 1 1 1 1 1 1 1 1 I I i I | | : I 
KGEVSARQELKARARYLAEKYEWDVAEARKIWCFGPDGTGPNILTDITKGVQ YLNE 


473 


Db 


619 


674 


Qy 


474 


VKDSIVQG 4 81 




Db 


675 


: 1 1 1 : 1 1 

IKDSWAG 682 





RESULT 6 
A25440 

translation elongation factor eEF-2 - Chinese hamster 
C; Species: Cricetulus griseus (Chinese hamster) 

C;Date: 05-Oct-1988 #sequence_revision 05-Oct-1988 #text_change 02-Feb-2001 
C;Accession: A25440; A28660 

R;Kohno, K.; Uchida, T.; Ohkubo, H.; Nakanishi, S.; Nakanishi, T.; Fukui, T.; 
Ohtsuka, E.; Ikehara, M. ; Okada, Y. 

Proc. Natl. Acad. Sci. U.S.A. 83, 4978-4982, 1986 

A; Title: Amino acid sequence of mammalian elongation factor 2 deduced from the 

cDNA sequence: homology with GTP-binding proteins. 

A;Reference number: A25440; MUID : 8 62597 16 ; PMID:3014523 

A;Accession: A25440 

A; Molecule type: mRNA 

A; Residues: 1-858 <KOH> 

A;Cross-references: GB:M13708; NID:gl91065; PIDN : AAA5 03 87 . 1 ; PID:g304505 
R;Nakanishi, T.; Kohno, K . ; Ishiura, M. ; Ohashi, H.; Uchida, T. 
J. Biol. Chem. 263, 6384-6391, 1988 

A; Title: Complete nucleotide sequence and characterization of the 5' -flanking 

region of mammalian elongation factor 2 gene. 

A; Reference number: A28660; MUID : 88 198187 ; PMID:2834376 

A/Accession: A28660 



A;Molecule type: DNA 

A; Residues: 1-15, 'E', 17-3 02, 'P', 304-44 0, 1 D 1 , 442-716, ' R 1 , 718-830, 'G', 832-8 5 8 
<NAK> 

A;Cross-references : GB:J03200; NID:gl91009; PIDN : AAA503 8 6 . 1 ; PID:g387049 
A;Note: the authors translated the codon GAA for residue 16 as Lys, CCC for 
residue 303 as Ala, and GGC for residue 831 as Pro 

C; Super family : translation elongation factor 2; translation elongation factor Tu 
homology 

C;. Keywords: diphthamide; GTP binding; nucleotide binding; P-loop; 
phosphoprotein; protein biosynthesis 

F; 2 0-1 61/Domain : translation elongation factor Tu homology <ETU> 
F;26-33/Region: nucleotide-binding motif A (P-loop) 
F; 15 8- 161/Region : GTP-binding NKXD motif 

F; 715/Modif ied site: 2 1 - [ 3-carboxamido-3- ( t rime thylammonio ) propyl ] histidine 
(His) #status predicted 

Query Match 34.2%; Score 863; DB 2; Length 858; 

Best Local Similarity 37.1%; Pred. No. 7.8e-56; 



Matches 


181; Conservative 95; Mismatches 190; Indels 22; Gaps 


5 


Qy 


9 


NLILSPLLGNVCFSSSQYSICFTLGSFAKI YADTF — - GDI NYQEFAKRLW 


56 


Db 


202 


1 : : : 1 : 1 1 1 1 1 : Ml 1 i : : 1 I I : : : | : |[ 
NIMIDPVLGTVGFGSGLHGWAFTLKQFAEMWAKFAAKGEGQLGPAERAKKVEDMMKKLW 


261 


Qy 


57 


GDI YFNPKT RKFTKKAPTS SSQ RSFVEFILEPLYKILAQWGDVDTSLPRTLDELGI 


113 


Db 


262 


M 1 1 : 1 i 1 : 1 1 : : 1 : 1 : 1 1 : I : : | : : : : :;:| | 
GDRYFDPANGKFSKSANSPDGKKLPRTFCQLILDPIFKVFDAIMNFRKEETAKLIEKLDI 


321 


Qy 


114 


HLTKEELKLNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDL 

1 f : : 1 1 1 : I : : : : | | : | | | j - | | | | : 
KLDSEDKDKEGKPLLKAVMRRWLPAGDALLQMITIHLPSPVTAQKYRCELLYEGPPDDEA 


173 


Db 


322 


381 


Qy 


174 


GEAMSDCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEED 
= Ml 1 1 1 I : : [ 1 II : 1 : 1 II 1 1 1 : 1 1 : : : I I I I : | : 
AMGIKSCDPKGPLMMYISKMVPTSDKGRFYAFGRVFSGWSTGLKVRIMGPNYTPGKKEE 


233 


Db 


382 


441 


Qy 


234 


S Q I CT VGRLW I S VAR YH I EVN RVP AGNWVL I E GVDQ P I VKT AT I T E P RGN E EAQ I FR P L K 
= : 1 : : 1 1 : 1 1 M 1 : 1 1 1 1 : 1 1 1 1 1 1 II | : | 
L YL K P I Q RT I LMMGR YVE PIE D VP C GN I VGL VGVDQ FL VKT GT ITT FEHAHNMRVMK 


293 


Db 


442 


498 


Qy 


294 


FNTTSVIKIAVEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVM 
1 : : 1 : : : 1 1 1 M : : M 1 : : : 1 1 : : : M 1 : : 1 ! 1 1 | | : | | | | | : | : : 
FSVSP VVRVAVEAKNPADLPKLVEGLKRLAKSDPMVQCIIEESGEHIIAGAGELHLEICL 


353 


Db 


499 


558 


Qy 


354 


HDLRKMYSEIDIKVADPVVTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIE 
1 1 s : : 1 1 1 : 1 II 1 : : 1 1 1 1 1 : : 1 : : : 1 1 1 1 : : I I I 1 1 1 1 1 1 : 
KDLEEDHACIPIKKSDPVVS YRETVSEESNVLCLSKSPNKHNRLYMKARPFPDGLAEDID 


413 


Db 


559 


618 


Qy 


414 


NEWQITWNRKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGS 

1 1 • 1 1 : 1 1 : 1 1 1 1 1 1 If 1 1 1 1 1 1 | | : I 
KGEVSARQELKARARYLAEKYEWDVAEARKIWCFGPDGTGPNILTDITKGVQ YLNE 


473 


Db 


619 


674 


Qy 


474 


VKDSIVQG 481 
: 1 1 I : 1 1 

IKDSWAG 682 




Db 


675 





RESULT 7 
S05988 

translation elongation factor eEF-2 - fruit fly (Drosophila melanogas ter ) 
C; Species: Drosophila melanogaster 

C;Date: 01-Dec-1989 #sequence_revision 01-Dec~1989 #text_change 02-Feb-2001 
C; Accession: S 0598 8 

R;Grinblat, Y . ; Brown, N.H.; Kafatos, F.C. 
Nucleic Acids Res. 17, 7303-7314, 1989 

A;Title: Isolation and characterization of the Drosophila translational 
elongation factor 2 gene. 

A;Reference number: S05988; MUID : 900 167 92 ; PMID:2508059 
A; Access ion: S 0598 8 
A; Molecule type: DNA 
A; Residues: 1-844 <GRI> 

A;Cross~references : EMBL:X15805; NID:g7918; PIDN : CAA338 04 . 1 ; PID:g7919 

C; Genetics : 

A; Gene : FlyBase: Ef 2b 

A; Cross-references : FlyBase : FBgn0000559 
A; Map position: 2L 39E-F 

C; Superf amily : translation elongation factor 2; translation elongation factor Tu 
homology 

C;Keywords: diphthamide; GTP binding; nucleotide binding; P-loop; 
phosphoprotein; protein biosynthesis 

F; 2 0-1 65 /Domain : translation elongation factor Tu homology <ETU> 
F; 2 6-3 3 /Region : nucleotide-binding motif A (P-loop) 
F; 162- 165 /Region : GTP-binding NKXD motif 
F; 60 9- 611 /Region : GTP-binding SAK/L motif 

F; 701/Modified site : 2 1 - [ 3-carboxamido-3- ( trimethylammonio ) propyl ] his tidine 



(His) 


#s tatus 


predicted 




Query Match 33.5%; Score 845.5; DB 2; Length 844; 
Best Local Similarity 37.2%; Pred. No. 1.5e-54; 

Matches 175; Conservative 103; Mismatches 183; Indels 9; Gaps 


4 


Qy 


12 


LSPLLGNVCFSSSQYSICFTLGSFAKI YADTFGDINYQEFAKRLWGDI YFNPKTRKFTKK 

: I 1 : 1 1 1 : lit | : : : | : : I 1 : : I 1 M : : 1 1 f 1 : 1 : : 1 
VDPSKGSVGFGSGLHGWAFTLKQFSEMYSEKF-KIDWKLMNRLWGENFFNAKTKKW-QK 


71 


Db 


208 


265 


Qy 


72 


APTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLNIRPLLRLV 
: : : I I I : j | : | : | | : : : : I : : : I : I I : : : | 1 : 1 
QKEADNKRSFCMYILDPI YKVFDAIMNYKKEEIGTLLEKIGVTLKHEDKDKDGKALLKTV 


131 


Db 


266 


325 


Qy 


132 


CKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDPDGPLMCHTT 

: : : 1 1 : 1 1 1 1 1 : : 1 II 1 : 1 : 1 1 1 1 1 i 1 1 : : 
MRTWLPAGEALLQMIAIHLPSPWAQKYRMEMLYEGPHDDEAAIAVKSCDPDGPLMMYIS 


191 


Db 


326 


385 


Qy 


192 


KMF S T HDGVQ FH P FGRVL S GT I HAGQ P VKVL GEN YT LEDEEDSQI CT VGRLW I S VARYH I 
1 1 1 i : | : | | | | : | : M : : : 1 1 1 1 : 1 I : t : : 1 1 
KMVPT S DKGRFYAFGRVFAGKVAT GQKCRIMGPN YT PGKKEDL YEKAI QRT I LMMGRYVE 


251 


Db 


386 


445 


Qy 


252 


EVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIAVEPVNPSE 

: 1 ! : 1 1 : 1 1 1 1 : 1 1 1 1 1 1 : : 1 : : II : : 1 : : : 1 1 1 1 1 1 : : 

AI E D VP S GN I C GL VG VD Q FL VKT GT ITT FKDAHNMKVMKF S VS P WRVAVE P KN PAD 


311 


Db 


446 


502 


Qy 


312 


LPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEIDIKVADPV 

1 1 1 : : : II : : : 1 1 1 : : 1 M 1 1 1 : 1 1 1 1 1 : I : : | 1 : : : 1 : 1 : 1 1 1 
LPKLVEGLKRLAKSDPMVQCIIEESGEHIIAGAGELHLEICLKDLEEDHACI PLKKSDPV 


371 


Db 


503 


562 



Qy 372 VTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWNRKKLGEFFQ 431 
f * : : : : | | | | : : | | | : : | | | 

Db 563 VSYRETVSEESDQMCLSKSPNKHNRLLMKALPMPDGLPEDIDNGEVSAKDEFKARARYLS 622 

432 TKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGSVKDSIVQG 481 
I M : I : I I I I I I I I I I f I :: t I : | : | | | : f | 

Db 623 EKYDYDVTEARKIWCFGPDGTGPNFILDCTKSVQ YLNEIKDSWAG 668 



RESULT 8 
T21362 

hypothetical protein F25H5.4 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text change 02-Sep-2000 
C;Accession: T21362 ~ 
R;Steward f C. 

submitted to the EMBL Data Library, October 1996 
A; Reference number: Z19412 
A; Accession: T213 62 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A; Residues: 1-852 <WIL> 

A;Cross-references: EMBL : Z 8 10 68 ; PIDN : CAB02985 . 1 ; GSPDB : GN0001 9 ; CESP:F25H5.4 

A; Experimental source: clone F25H5 

C; Genetics : 

A; Gene : CESP : F25H5 . 4 

A; Map position: 1 

A;Introns: 1/3; 73/2; 191/3; 250/3; 752/1 

C;Superfamily: translation elongation factor 2; translation elongation factor T\ 
homology 

F;20-173/Domain: translation elongation factor Tu homology <ETU> 

Query Match 33.1%; Score 837.5; DB 2; Length 852; 

Best Local Similarity 36.2%; Pred. No. 6e-54; 

Matches 175; Conservative 103; Mismatches 193; Indels 13; Gaps 5; 



Qy 


2 


AYYSTDEN LILSPLLGNVCFSSSQYSICFTLGSFAK1 YADTFGDINYQEFAKRLWG 

If 1 : : : : | : | | | | | : | | | || . . | , , , . . , f| , 

ATYGDDDGPMGPIMVDPSIGNVGFGSGLHGWAFTLKQFAEMYAGKFG-VQVDKLMKNLWG 


57 


Db 


202 


260 


Qy 


58 


DIYFNPKTRKFTKKAPTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTK 


117 


Db 


261 


i : 1 : M : 1 : : 1 1 : 1 f : I : | : f : : : | : : : : 1 1 1 1 

DRFFDLKTKKWS-STQTDESKRGFCQFVLDPIFMVFDAVMNIKKDKTAALVEKLGIKLAN 


319 


Qy 


118 


EELKLNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAM 
: 1 1 Ml::: : | : s | f - ( | , ( . . , , , , . , . 

DEKDLEGKPLMKVFMRKWLPAGDTMLQMIAFHLPSPVTAQKYRMEMLYEGPHDDEAAVAI 


177 


Db 


320 


379 


Qy 


178 


SDCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQIC 
1 N : 1 1 1 I : : I I If : 1 : MM II : ! :: | [| : | | 

KTCDPNGPLMMYISKMVPTSDKGRFYAFGRVFSGKVATGMKAR1QGPNYVPGKKEDLYEK 


237 


Db 


380 


439 


Qy 


238 


TVGRLWISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTT 
1 : 1 = = 1 : : M : M : M 1 1 : II II I : : I I : M : : 
T 1 QRT I LMMGRFi EPIEDIPS GNI AGLVGVDQYLVKGGT ITT YKDAHNMRVMKFS VS 


297 


Db 


440 


496 



Oy 298 SVIKIAVEPWPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLR 357 

| : : : j | | || : : I i I : : : I I : : : I I I : I : I I I I I : I : : I I 

Db 497 PVVRVAVEAKNPADLPKLVEGLKRLAKSDPMVQCIFEESGEHIIAGAGELHLEICLKDLE 556 

Qy 358 KMYSEIDIKVADPWTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEW 417 

: : : | : | : I I I I : : I I I I : I : : : I I I I : : I : I : I I I : M I I 

Db 557 EDHACIPLKKSDPWSYRETVQSESNQICLSKSPNKHNRLHCTAQPMPDGLADDIEGGTV 616 

Qy 418 QITWNRKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGSVKDS 477 

j : I I : : 1 : I I I I I I I I I I I h h I I : ' : I I I 

Db 617 NARDEFKARAKILAEKYEYDVTEARKIWCFGPDGTGPNLLMDVTKGVQ YLNEIKDS 672 

Qy 478 IVQG 481 

: I I 

Db 673 WAG 676 



RESULT 9 
A40411 

translation elongation factor eEF-2 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 17-Jul-1992 #sequence_revision 17-Jul-1992 #text_change 02-Feb-2001 

C; Access ion: A4 0411 

R;Ofulue, E.N.; Candido, E.P.M. 

DNA Cell Biol. 10, 603-611, 1991 

A; Title: Molecular cloning and characterization of the Caenorhabditis elegans 
elongation factor 2 gene (eft-2) . 

A; Reference number: A40411; MUID : 92 02 9622 ; PMID: 1930695 
A; Accession: A4 0411 
A;Molecule type: mRNA 
A; Residues: 1-852 <OFU> 

A; Cross-references: GB:M86959; NID:gl56278; PIDN : AAD0333 9 . 1 ; PID:gl56279 
C;Superfamily: translation elongation factor 2; translation elongation factor Tu 
homology 

C; Keywords: diphthamide; GTP binding; nucleotide binding; P-loop; 
phosphoprotein; protein biosynthesis 

F;20-173/Domain: translation elongation factor Tu homology <ETU> 
F;26-33/Region: nucleotide-binding motif A (P-loop) 
F; 17 0-17 3 /Region : GTP-binding NKXD motif 

jr. 7Q9 /Modified site: 2 [ 3-carboxamido-3- ( trimethylammonio ) propyl] histidine 
(His) #status predicted 

Query Match 32.7%; Score 827.5; DB 2; Length 852; 

Best Local Similarity 36.0%; Pred. No. 3.3e-53; 

Matches 174; Conservative 104; Mismatches 193; Indels 13; Gaps 5; 



Qy 


2 


AYYSTDEN LILSPLLGNVCFSSSQYSICFTLGSFAKIYADTFGDINYQEFAKRLWG 

|| | : : : : I : 1 1 1 ! 1 : Ml 1 1 : : M 1 1 : : INI 

ATYGDDDGPMGPIMVDPSIGNVGFGSGLHGWAFTLKQFAEMYAGKFG-VQVDKLMKNLWG 


57 


Db 


202 


260 


Qy 


58 


DIYFNPKTRKFTKKAPTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTK 

| : | : M : | : : 1 | : | | : I : I : 1 : : : 1 : : : : 1 1 1 1 

DRFFDLKTKKWS-STQTDESKRGFCQFVLDPIFMVFDAVMNIKKDKTAALVEKLGIKLAN 


117 


Db 


261 


319 


Qy 


118 


EELKLNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAM 


177 


Db 


320 


: | | : | | : : : : | : : 1 hill 1 : : 1 11 1 : 1 : 
DEKDLEGKPLMKVFMRKWLPAGDTMLQMIAFHLPSPVTAQKYRMEMLYEGPHDDEAAVAI 


379 



QY 178 SDCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQIC 237 

I I I : I I I I :: I I II : I : I I I I I I : I : : I M : | | 

Db 380 KTCDPNGPLMMYISKMVPTSDKGRFYAFGRVFSGKVATGMKARIQGPNYVPGKKEDLYEK 439 

Qy 238 TVGRLWISVARYHIEVNRVPAGMWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTT 2 97 

I : I : : I : : : I : ! I : I I I I : I i III : : I I : I | : : 

Db 440 T I QRT I LMMGRFI EP I ED I P S GNI AGLVG VDQYLVKGGT ITT YKDAHNMRVMKFSVS 496 

Qy 298 SVIKIAVEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLR 357 

I : : : I I I I I : : I I I : : : I I : : : II I : II I I I I : I I I M : I : : II 

Db 497 PWRVAVEAKNP^JDLPKLVEGLKRLAKSDPMVQCIFEESGEHIIAGAGELHLEICLKDLE 556 

Qy 358 KMYSEIDIKVADPAA/TFCETVVETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEVV 417 

: : : I : I : I I I I : : I I I I : I : : : I I I I : : I : I : I I I : I I j i 
Db 557 EDHACIPLKKSDPWSYRETVQSESNQICLSKSPNKHNRLHCTAQPMPDGLADDIEGGTV 616 

Qy 418 QITWNRKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGSVKDS 477 

I II::: I I I I I I I I I I I I : I : I | : I : | | | 

Db 617 SARDEFKARAKYPGEKYEYAVTEARKIWCFGPDGTGPNLLMDVTKGVQ YLNEIKDS 672 

Qy 478 IVQG 481 

: f I 

Db 673 WAG 676 



RESULT 10 
T14579 

translation elongation factor eEF-2 - beet 
C; Species: Beta vulgaris (beet) 

C;Date: 20-Sep-1999 #sequence_revision 20-Sep-1999 #text_change 20-Jun-2000 
C; Accession: T14 57 9 
R;Vogel, R. 

submitted to the EMBL Data Library, June 1997 
A; Reference number: Z 18 14 8 
A; Accession : T14 57 9 

A; Status : preliminary; translated from GB/EMBL/DDBJ 

A; Molecule type: mRNA 

A; Residues: 1-843 <VOG> 

A; Cross-ref erences : EMBL : Z 97 17 8 

A; Experimental source: root 

C; Superf amily : translation elongation factor 2; translation elongation factor Tu 
homology 

C; Keywords: GTP binding; phosphoprotein 

F; 2 0-1 61 /Domain : translation elongation factor Tu homology <ETU> 

Query Match 32.3%; Score 816.5; DB 2; Length 843; 

Best Local Similarity 35.7%; Pred. No. 2.1e-52; 

Matches 175; Conservative 111; Mismatches 17 9; Indels 25; Gaps 8; 

Qy 9 NLILS PLLGN VCFSSSQYSICFTLGSFAKI YADTFGDINYQEFAKRL 55 

1 = 1:: MM: III: : II I : | | | : ( | || : : : Ml 

Db 186 NVIMATYEDPLLGDVQVYPEKGTVAFSAGLHGWAFTLSNFAKMYASKFG-VDESKMMERL 244 

Qy 56 WGDIYFNPKTRKF-TKKAPTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIH 114 

I M : I : I I : I : I I : : I : | | | : I ||: :|:| : | | : : II I 

Db 245 WGENFFDPATKKWTTKNSGNASCKRGFVQFCYEPIKQIIAACMNDQKDKLLAHVTKLGIQ 304 



Qy 115 LTKEELKLNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLG 174 

: II I | | | : : | : : : : : I : I : I I I : : : I : I I : j 

Db 305 MKTEEKDLMGRPLMKRVMQTWLPASSALLEMMIHHLPSPATAQRYRVENLYEGPMDDVYA 364 

Qy 175 EAMSDCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDS 234 

I : : I I I : I I I I : : II I : I I I M : I : I I : : : I I I : : : | 

Db 365 TAIRNCDPEGPLMLYVSKMIPASDKGRFFAFGRVFAGKVSTGMKVRIMGPNYVPGEKKDL 424 

Qy 235 Q I CT VG RLW I S VARYH I EVN RVP AGNWVL I E GVDQ P I VKT AT I T E P RGN E EAQ I F R P L K F 2 94 

: I I I : : I I I I I I : I : I I I I I I : I : : I I : I I 

Db 425 YVKNVQRTVIWMGKKQETVEDVPCGNTV7VLVGLDQYITKNATLTNEK— ESDAHPIRAMKF 483 

Qy 295 NTTSVIKIAVEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMH 354 

: : I : : : I I : I : I I I : : : I I : : : II I : : I I I II I : I I I I I : I : : 

Db 484 SVSPWRVAVQCKVASDLPKLVEGLKRLAKSDPMWCSIEESGEHIIAGAGELHLEICLK 543 

Qy 355 DLRKMY-SEIDIKVADPVVTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIE 413 

II: : : I : M I I : I I I I : : i : : : I I I I : : 1 I I : I : I I I I I : 

Db 544 DLQDDFMGGAEI I KSDPWS FRETVLDRS VRTVMSKS PNKHNRLYMEARPMEEGLAEAI D 603 

Qy 414 NEWQITWNRKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKAL — L 471 

: : ! : : I I I I : I I I I I : I I I I : : I I : I : I 

Db 604 EGRI GPRDDPKNRS KI LAEEYGWDKDLAKKIWCFGPETTGPNMVVD MCKGVQYL 657 

Qy 4 72 GSVKDSIVQG 481 

: I I j : I I 
Db 65 8 N E I KD SWAG 667 



RESULT 11 
A41778 

translation elongation factor eEF-2 - yeast ( Saccharomyces cerevisiae) 

N; Alternate names: protein D9481.22; protein 03317; protein YDR385w; protein 

YOR133w; protein YOR3317w 

C; Species: Saccharomyces cerevisiae 

C;Date: 30-Sep-1993 #sequence_revision 30-Sep-1993 #text_change 02-Feb-2001 
C;Accession: A41778; S60995; S61689; S67018; S61180; S69669; S63872 
R; Perentesis , J.P.; Phan, L.D.; Gleason, W.B.; LaPorte f D.C.; Livingston, D.M.; 
Bodley, J.W. 

J. Biol. Chem. 267, 1190-1197, 1992 

A; Title: Saccharomyces cerevisiae elongation factor 2. Genetic cloning, 

characterization of expression, and G-domain modeling. 

A; Reference number: A41778; MUID : 92112 7 60 ; PMID: 1730643 

A; Access ion: A417 7 8 

A;Molecule type : DNA 

A; Residues: 1-842 <PER> 

A; Cross-references : EMBL:M593 69; NID: g4 15721; P I DN : AAA2 164 6.1; PID: g549849 
A;Note: sequence extracted from NCBI backbone (NCBIN : 7 6324 , NCBIP:76328) 
R;Van Ness, B.G.; Howard, J.B.; Bodley, J.W. 
J. Biol. Chem. 255, 10717-10720, 1980 

A;Title: ADP-ribosylation of elongation factor 2 by diphtheria toxin. Isolation 

and properties of the novel ribosyl-amino acid and its hydrolysis products. 

A;Reference number: A44720; MUID : 81046928 ; PMID:7000782 

A; Contents: annotation; identification and isolation of dipthamide 

R;Wiemann, S.; Rechmann, S.; Benes, V.; Voss, H.; Schwager, C; Vlcek, C . ; 

Stegemann, J. ; Zimmermann, J.; Erfle, H. ; Paces, V.; Ansorge, W. 



submitted to the EMBL Data Library, August 1995 

A; Description : Sequencing of 51 kilobases on the right arm of chromosome XV from 

S. cerevisiae reveals 30 open reading frames. 

A;Reference number: S60983 

A;Accession: S60995 

A; Molecule type: DNA 

A; Residues: 1-842 <WIE> 

A; Cross-references: EMBL : X90518 ; NID: gl 0508 08 ; PIDN : CAA62 116 . 1; PID:gl050821 
A; Genetics: EFT1 

R;Benes, V.; Andrade, M.A. ; Rechmann, S.; Teodoru, C; Banrevi, A.; Sander, C; 
Valencia, A.; Ansorge, W.; Voss, H. 

submitted to the EMBL Data Library, December 1995 

A; Description : Nucleotide sequence and analysis of a 130 kb fragment of yeast 

chromosome XV. 

A; Reference number: S61643 

A;Accession: S61689 

A; Molecule type: DNA 

A; Residues: 1-842 <BEN> 

A; Cross-references : EMBL:X94 33 5; NID : gl2 62 13 9 ; PIDN : CAA64 052 . 1; PID: gl 164 977 
A; Genetics: EFT1 

R;Voss, H.; Benes, V.; Rechmann, S.; Teodoru, C; Schwager, C; Paces, V.; 
Ansorge, W. 

submitted to the Protein Sequence Database, July 1996 
A; Reference number: S66965 
A;Accession: S67018 
A; Molecule type: DNA 
A; Residues: 1-842 <VOS> 

A; Cross-references: EMBL:Z75041; MIPS : YORl33w; NID : gl42 034 1 ; PIDN : CAA9 9332 . 1 ; 
PID:gl420342 

A; Experimental source: strain S288C 
A; Genetics : EFTl 
R;Ding, H. 

submitted to the EMBL Data Library, June 1995 

A; Description : The sequence of S. cerevisiae cosmid 9481. 

A; Reference number: S61159 

A;Accession: S61180 

A; Molecule type: DNA 

A; Residues: 1-515 <DIN> 

A; Cross-references : EMBL:U2 8373; NID: g8 4 9184; PIDN : AAB64 82 1 . 1; PID: g8 4 92 06 
A; Experimental source: strain S28 8C (AB972) 
A; Genetics: EFT2 
R; Dietrich, F.S. 

submitted to the EMBL Data Library, July 1995 

A; Description : The sequence of S. cerevisiae cosmids 9481, 9509, 9926, 9461, and 
lambda 3641. 

A; Reference number: S69665 
A;Accession: S69669 
A; Molecule type: DNA 
A; Residues: 1-842 <DIE> 

A; Cross-references: EMBL:U32274; NID : g927313 ; PIDN : AAB64 827 . 1 ; PID:g927318 
A; Genetics : EFT 2 

R;Wiemann, S.; Rechmann, S.; Benes, V.; Voss, H.; Schwager, C . ; Vlcek, C; 
Stegemann, J.; Zimmermann, J.; Erfle, H.; Paces, V.; Ansorge, W. 
Yeast 12, 281-288, 1996 

A; Title: Sequencing and analysis of 51 kb on the right arm of chromosome XV from 
Saccharomyces cerevisiae reveals 30 open reading frames . 
A; Reference number: S63860; MUID : 97 06002 0 ; PMID: 8904341 



A; Accession : S63872 

A; Status : nucleic acid sequence not shown; translation not shown 
A; Molecule type: DNA 
A; Residues: 1-842 <WIW> 

A; Cross-references : EMBL:X90518; NID : glOSO 8 0 8 ; PIDN : CAA62 116 . 1 ; PID:gl050821 
A;Note: the nucleotide sequence was submitted to the EMBL Data Library, August 
1995 

C; Genetics : <EFT1> 
A; Gene: SGD : EFT1 

A; Cross -references : MIPS : YORl33w; SGD: SO 005 659 

A;Map position: 15R 

A;Note: YORl33w 

C; Genetics : <EFT2> 

A; Gene: SGD : EFT2 

A; Cross-references : MIPS : YDR3 85w; SGD:S0002793 
A; Map position : 4R 
A; Note: YDR3 8 5w 

C; Superf amily : translation elongation factor 2; translation elongation factor Tu 
homology 

C; Keywords: diphthamide; GTP binding; nucleotide binding; P-loop; protein 
biosynthesis 

F; 2 0-1 61 /Domain : translation elongation factor Tu homology <ETU> 
F; 2 6-33/ Region : nucleotide-binding motif A (P-loop) 
F; 158-161/Region: GTP-binding NKXD motif 

F; 699/Modif led site : 2 1 - [3-carboxamido-3- ( trimethylammonio ) propyl] histidine 
(His) #status predicted 

Query Match 32.0%; Score 807.5; DB 2; Length 842; 

Best Local Similarity 35.7%; Pred. No. le-51; 

Matches 168; Conservative 106; Mismatches 182; Indels 15; Gaps 5; 



Qy 


14 


PLLGNVCFSSSQYSICFTLGSFAKI YADTFGDINYQEFAKRLWGDIYFNPKTRKFTKK — 

I llll : II: 11 It II:: : 1 II 1 1 : 1 1 1 1 t : 1 : 1 1 
PARGTVAFGSGLHGWAFTIRQFATRYAKKFG-VDKAKMMDRLWGDSFFNPKTKKWTNKDT 


71 


Db 


204 


262 


Qy 


72 


-APTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLNIRPLLRL 


130 


Db 


263 


| : | : | ! | | : | : : : : : : : | 1 : : 1 1 i : 1 1 = 1 1 = =r 
DAEGKPLERAFNMFILDPIFRLFTAIMNFKKDEIPVLLEKLEIVLKGDEKDLEGKALLKV 


322 


Qy 


131 


VCKKFFGEFTGFVDMCVQHI PSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDPDGPLMCHT 

I : I I : : | | | : M 1 : i II 1 1 : : 1 1 1 1 1 : 
A/MRKFLPAADALLEMIVLHLPSPVTAQAYRAEQLYEGPADDANCIAIPCNCDPKADLMLYV 


190 


Db 


323 


382 


Qy 


191 


TKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLWISVARYH 

: I I || : | : 1 1 1 1 : 1 1 : : 1 1 1 : : 1 1 1 : : 1 1 : 1 : : : 1 : 

SKMVPTSDKGRFYAFGRVFAGTVKSGQKVRIQGPNYVPGKKDDLFIKAIQRWLMMGRFV 


250 


Db 


383 


442 


Qy 


251 


IEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIAVEPVNPS 

: : M | | : : | : | I : : | | | : 1 : 1 ! : : I 1 : : 1 : : : M 1 1 : 
EPIDDCPAGNI IGLVGIDQFLLKTGTLTT S ETAHNMKVMKFSVS PWQVAVEVKNAN 


310 


Db 


443 


499 


Qy 


311 


ELPKMLDGLRKWKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEIDIKVADP 

: I I I : : : 1 1 : : : : 1 1 1 : 1 : Mill:: | | i | I : I : : I I : : : : I : : 1 

DLPKLVEGLKRLSKSDPCVLTYMSESGEHIVAGTGELHLEICLQDLEHDHAGVPLKISPP 


370 


Db 


500 


559 


Qy 


371 


WTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWNRKKLGEFF 


430 



II : III 



Db 



560 WAYRETVESESSQTALSKSPNKHNRIYLKAEPIDEEVS LAI ENGI INPRDDFKARARIM 61 



Qy 431 QT K YDWD LLAARS I WAFG P DAT GPN I LVD DT LPS EVD KAL — LGSVKDSIV 479 

IN: I I I I I I I I I I I : : : I I I I : I : I 1 I : I 

Db 620 ADDYGWDVT DARKIWCFGPDGNGPNLVI DQT KAVQYLHEIKDSW 664 



RESULT 12 
T41697 

translation elongation factor eEF-2 [similarity] - fission yeast 
( Schizosaccharomyces pombe) 
C; Species: Schizosaccharomyces pombe 

C;Date: 03-Dec-1999 #sequence_revision 03-Dec-1999 #text_change 03-Nov-2000 
C;Accession: T41697; T39256; T38896; T51993; T51994; T42427 

R;Wedler, H.; Dues terhoef t , A.; McDougall, R.C.; Rajandream, M.A. ; Barrell, B 

submitted to the EMBL Data Library, October 1999 

A;Reference number: Z22010 

A; Accession : T41697 

A; Molecule type: DNA 

A; Residues: 1-842 <WED> 

A;Cross-references : EMBL : AL12 185 9 ; P I DN : CAB 5 8373.1; GSPDB : GN00068 ; 
SPDB:SPCP31B10. 07 

A; Experimental source: strain 972h-; clone pi p3lBlO 
A; Genetics: SP2 

R;Stevens, K. ; Churcher, C.; McDougall, R. ; Rajandream, M.A. ; Barrell, B.G. 

submitted to the EMBL Data Library, August 1999 

A; Reference number: Z21838 

A; Accession: T39256 

A;Molecule type: DNA 

A;Residues: 31-842 <STE> 

A;Cross-references: EMBL : AL109734 ; PIDN : CAB52 14 7 . 1 ; GSPDB : GN00 066 ; 
SPDB: SPAPYUK7 1 . 04c 

A; Experimental source: strain 972h-; cosmid pYUK71 
A; Genetics: SP1 

R;Seeger, K.; Harris, D.; McDougall, R.C.; Rajandream, M.A. ; Barrell, B.G. 

submitted to the EMBL Data Library, October 1999 

A;Reference number: Z21816 

A; Accession: T38896 

A;Molecule type: DNA 

A; Residues: 1-63 <SEE> 

A;Cross-references : EMBL : AL122032 ; PIDN : CAB5 872 4 . 1 ; GSPDB : GN000 66 ; 
SPDB: SPAC513 . 01c 

A; Experimental source: strain 972h-; cosmid c513 
A; Genetics: SPl 

R;Mita, K.; Morimyo, M. ; Ito, K. ; Sugaya, K. ; Ebihara, K. ; Hongo, E.; Higashi 
T.; Hirayama, Y. ; Nakamura, Y. 
Gene 187, 259-266, 1997 

A; Title: Comprehensive cloning of Schizosaccharomyces pombe genes encoding 
translation elongation factors. 

A;Reference number: Z22377; MUID : 972544 80 ; PMID:9099890 
A; Accession: T51993 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 

A; Residues: 1-820, 1 D VG 1 , 824-842 <MIT1> 
A; Cross-references : EMBL: D83 975; PIDN : BAA2 35 90 . 1 
A; Experimental source: strain JY333, gene EF-2 . 1 
A; Accession: T51994 



A; Status: preliminary; translated from GB/EMBL/DDB J 
A;Molecule type: DNA 

A;Residues: 1-82 0 , ' DVG ' , 82 4-842 <MIT2> 

A; Cross-references : EMBL : D8397 6; PIDN : BAA235 91 . 1 

A; Experimental source: strain JY333, gene EF-2 . 2 

R; Yoshioka, S.; Kato, K. ; Nakai, K.; Okayama, H.; Nojima, H. 

DNA Res. 4, 363-369, 1997 

A; Title: Identification of open reading frames in Schizosaccharomyces pombe 
cDNAs . 

A; Reference number: Z17323; MUID : 98162722 ; PMID: 9501991 
A; Accession: T42427 

A; Status : preliminary; translated from GB/EMBL/DDB J 
A; Molecule type: mRNA 

A; Residues: 177-322, 'D' ,324-463, 'A', 465- 

700, ' WWWSNYSYCSSCRLRFYPFGFSHHPGARFLGRNPSFRKRYGWYLLCP ' <YOS> 

A;Cross-references: EMBL:D89151; NID : gl7 4 95 09 ; PIDN : BAA13813 . 1 ; PID:gl749510 
A; Experimental source: strain PR7 45 
C; Genetics: <SP1> 

A; Gene: SPDB : SPAPYUK7 1 . 04c; SPDB: SPAC513 . 01c 

A; Map position : 1 

C; Genetics: <SP2> 

A; Gene: SPDB : SPCP31B10 . 07 

A; Map position : 3 

C; Super family: translation elongation factor 2; translation elongation factor Tu 
homology 

C; Keywords: GTP binding; protein biosynthesis 

F;20-161/Domain: translation elongation factor Tu homology <ETU> 

Query Match 31.7%; Score 801.5; DB 2; Length 842; 

Best Local Similarity 36.2%; Pred. No. 2.8e-51; 

Matches 170; Conservative 105; Mismatches 183; Indels 11; Gaps 4; 



Qy 


14 


PLLGNVCFSSSQYSICFTLGSFAKI YADTFGDINYQEFAKRLWGDI YFNPKTRKFTKKAP 

t 111:1 : II: II 1 1 Mi: : : I I I I : I I 1 1 1 1 : 1 : : 1 1 
PDKGTVAFASGLHGWAFTVRQFANRYAKKFG-IDRNKMMQRLWGENYFNPKTKKWSKSAT 


73 


Db 


204 


262 


Qy 


74 


T SSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLNIRPLLRL 

: 1 : 1 1 : 1 1 1 1 : 1 : 1 : 1 1 : : 1 : 1 : : : 1 : 1 : 1 1 : : 
DANGNSNQRAFNMFILDPI YRIFDAVMNSRKDEVFTLLSKLEVTIKPDEKELEGKALLKV 


130 


Db 


263 


322 


Qy 


131 


VCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDPDGPLMCHT 

I : M : : 1 1 1 : M 1 1 : : 1 1 ! : I : : : | | : 1 I I : 

VMRKFLPAADALMEMIVLHLPSPKTAQQYRAETLYEGPMDDECAVGIRNCDANAPLMIYV 


190 


Db 


323 


382 


Qy 


191 


TKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLWISVARYH 

: I I |[ : 1 : 1 1 1 1 1 1 1 : : 1 I : : 1 II : : 1 1 : 1 : : 

SKMVPTSDRGRFYAFGRVFSGTVRSGLKVRIQGPNYVPGKKDDLFIKAIQRTVLMMGSRI 


250 


Db 


383 


442 


Qy 


251 


IEVNRVPAGNWLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIAVEPVNPS 

: I I 1 I : : 1 1 1 1 : 1 t : 1 : 1 : 1 1 : : I I : : 1 : : : 1 II 1 : 

E P I ED C P AGNI I GLVGVDQ FLVKS GT LT T SEVAHNMKVMKFSVS PWQVAVEVKNGN 


310 


Db 


443 


499 


Qy 


311 


ELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEIDIKVADP 

: I I I : : : I I : : : : II 1 : Mill:: | || 1 : 1 : : M : : : : 1 = 1 : : 1 

DLPKLVEGLKRLSKSDPCVLCTTSESGEHIVAGAGELHLEICLKDLQEDHAGIPLKISPP 


370 


Db 


500 


559 


Qy 


371 


WTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWNRKKLGEFF 


430 



Db 560 WSYRESVSEPSSMTALSKSPNKHNRimTAEPMSEEisVAlETGHWPRDDFivRARIM 619 

QY 431 QTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGSVKDSIV 479 

: : I I : I I I I I I I I I | | :: | | | i i . i , , . , 

Db 620 ADEFGWDVTDARKIWCFGPDTTGANVVVDQTKA VAYLNEIKDSVV 664 

RESULT 13 
A96602 

C, Species: Arabidopsis thaliana (mouse-ear cress) 

cjAc^ssion^^O^ 1 # "^ n ~-«^°n 02-Mar-2001 #text_chan g e 23-Mar-2001 

AioL e ^Tr^af EC ^'Ar J ;uio TV^'" *?*"*^> »■*■> S.; White, O. 

Char, a . C raU] °' R * ; Bowman, C.L.; Brooks, S.Y.; Buehler E * 

Haas, B.; Hansen, N.F.; Hughes, B. ; Huizar L ' ' G ° ldsmlth - A.D.; 

Nature 408, 816-820, 2000 

Hooper, 3., Lea, A.; Le„, j.m.; L«n CA Li' S'h'^ ^ C J • ' B -'' Langl " 

Rizzo, M. ; Rooney, T. ; Rowley, D . ; Sa kano ,' H '. ' ' *^ &rS ™< J " ' ^ 

XSon^r/" 1 ?^' S - L -'' Schwart2 ' J -K"- Shinn, P.; Southwick, A.M. ; Sun H • 

J;^" e: Se quence and analysis of chromosome 1 of the plant Arabidopsis 
A;Reference number: A86141; MUID : 21016719; PMID • 11130712 Ara *> lclo P sl s. 
A;Accession: A96602 
A; Status: preliminary 
A; Molecule type: DNA 
A; Residues: 1-846 <STO> 

cjSne^csf^ 110631 GB:AE005173; NID:g6056373; PIDN : AA.F02 837 . 1 ; GSPDB : GN0014 1 
A;Gene: T6H22 . 13 
A; Map position: 1 

C;Superfamily: translation elongation factor f ratte i af ■ 

homology ngarion factor 2, translation elongation factor Tu 

Kt^aTs^ilarity l\\\V ; £« S^'uS**' 

"* tdtl * S 172 '' C — ative 115; Mismatches 178; Indels 25; Gaps 8; 

QY 9 ^? VCFSSSQYSICFTLGSFAKIYADTFGDINYQEFAKRL 55 

Db 1 89 NVIMATYEDPLLGDVQVYPEKGTVAFSAGLHGWAFTLTNFAKMYA 247 

Qy 

Db 

Qy 



56 WGDIYFNPKTRKFT-KKAPTSSSQRSFVEFILEPLYKILAQVVGDVDTSLPRTLDELGIH 114 
248 WGENFFDPATRKWS GKNTGS PTCKRGFVQFC YEP I KQI I ATCMNDQKDKLWPMLAKLGVS 307 
115 LTKEELKLNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLG 174 



Db 


308 


MKNDEKELMGKPLMKRVMQTWLPASTALLEMMIFHLPSPHTAQRYRVENLYEGPLDDQYA 


367 


Qy 


175 


EAMSDCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDS 

1 : : II 1 : 1 i 1 1 : : I 1 1 : I 1 1 1 1 : 1 : 1 1 : : : 1 1 1 : : : ! 
NAIRN CD PNGPLMLYVSKMI PAS DKGRFFAFGRVFAGKVSTGMKVRIMGPNYIPGEKKDL 


234 


Db 


368 


427 


Qy 


235 


QICTVGRLWISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQI FRPLKF 

: 1 1 I : : I 1 I 1 1 1 : I : 1 1 1 1 II : 1 : : 1 1 : 1 1 
YTKSVQRTVIWMGKRQETVEDVPCGNTVAMVGLDQFITKNATLTNEK-EVDAHPIRAMKF 


294 


Db 


428 


486 


Qy 


295 


NTTSVIKIAVEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMH 

: : 1 : : : ! i : 1 : I I 1 : : : I 1 : : : II 1 : : 1 1 1 1 II : : 1 1 II : 1 : : 
SVSPWRVAVQCKVASDLPKLVEGLKRLAKSDPMVVCTMEESGEHIVAGAGELHLEICLK 


354 


Db 


487 


546 


Qy 


355 


DLRKMY-SEIDIKVADPVVTFCETVVETSSLKCFAETPNKKNKITMI7VEPLEKGLAEDIE 

II: : : 1 : 1 1 1 1 : 1 1 1 1 : 1 : : : : 1 1 1 1 : : 1 1 1 : 1 : 1 1 1 1 1 : 
DLQDD FMGGAEI I KS D PWS FRETVCDRSTRTVMS KS PNKHNRLYMEARPMEEGLAEAI D 


413 


Db 


547 


606 


Qy 


414 


NEWQITWNRKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKAL — L 
: : : I : : : 1 1 1 : 1 1 1 1 1 1 : I 1 1 1 : : 1 1 : I : i 
DGRIGPRDDPKIRSKILAEEFGWDKDLAKKIWAFGPETTGPNMWD MCKGVQYL 


471 


Db 


607 


660 


Qy 


472 


GSVKDS I VQG 4 81 

: 1 1 1 : 1 1 
N E I KD SWAG 67 0 




Db 


661 





RESULT 14 
G90128 

elongation factor EF-2 [imported] - Guillardia theta nucleomorph 
C; Species: nucleomorph Guillardia theta 

A; Note: a nucleomorph is the vestigial nucleus of a eukaryotic endosymbiont 
C;Date: 10-May-2001 #sequence_revision 10-May-2001 #text_change 15-Jun-2001 
C; Access ion: G9012 8 

R;Douglas, S.; Zauner, S.; Fraunholz, M. ; Beaton, M. ; Penny, S.; Deng, L.T.; Wu, 
X.; Reith, M. ; Cavalier-Smith, T . ; Maier, U.G. 
Nature 410, 1091-1096, 2001 

A; Title: The highly reduced genome of an enslaved algal nucleus . 
A; Reference number: A99082; MUID : 11323671 ; PMID : 1132 367 1 
A;Accession: G90128 
A; Status: preliminary 
A; Molecule type: DNA 
A; Residues: 1-84 8 <DOU> 

A; Cross-references : GB:AF083031; NID : gl37 94345 ; PIDN : AAK3 9722 . 1 ; GSPDB : GN00 152 

C; Genetics : 

A; Gene : EF2 

A;Map position: 3 

A; Genome: nucleomorph 

C; Super family : translation elongation factor 2; translation elongation factor Tu 
homology 

C;Keywords: nucleomorph 

Query Match 31.1%; Score 787; DB 2; Length 84 8; 

Best Local Similarity 34.0%; Pred. No. 3.4e-50; 

Matches 174; Conservative 118; Mismatches 176; Indels 44; Gaps 11; 



Qy 1 RAYYSTDEN LILSPLLGNVCFSSSQYSICFTLGSFAKI YADTF — 43 

1:1 M : : I I : I I I I I : : III i I : : I I : 

Db 17 4 RTFYKVIENVNVIMVTYQDDLLGDVQVSPEVGNVAFSAGLHGWAFTLSQFARMYAKKWKI 233 

Qy 44 -GDINYQEFAK RLWGDIYFNPKTRKFTKKAPTSSSQRSFVEFILEPLYKILAQWG 98 

I : I I I : I I I I : I : I : : I : I I I : I : I I I I I I I : I [ : : 

Db 234 EGE-KLNEFTKKLTLKLWGDNFFDPSSKKWTKK-EDNSIERSFSHFILNPIKKIINYAMS 291 

Qy 99 DVDTSLPRTLDELGIHLTKEELKLNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAK 158 

I : : : : I : I : : I : | : : | : | : : I : I : I I I 

Db 292 DKVDEI VKAMEGFS I KI TNDDKS LKQKNLMKRVLQKWLPADQALKEMI IHHLPS PARAQF 351 

Qy 15 9 PKIEHTYTGGVDSDLGEAMSDCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQP 218 

: : : I I = : : I : : I I : I I I I : : I I : | : | I I I I MM M 
Db 352 YRVGNLYEGPLNDEFAESIKKCDSNGPLMVYISKMIPSSDKGRFIAFGRVFSGTIKTGQK 411 

Qy 219 VKVLGENYTLEDEEDSQICTVGRLWISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATIT 278 

II : I I I I : : : I :llll I I |:|| :||: ||: 

Db 412 I RIMGSNYI PGKKTDLS I KNVQRVLLMMGRKVEI I DNLPAGNTVGI VGI DQCLVKS GTI S 471 

Qy 279 EPRGNEEAQIFRPLKFNTTSVIKIAVEPVNPSELPKMLDGLRKVNKS YPSLTTKVEESGE 338 

= : I I : M : : : M : : M M I M M M I : : M M : : M I I : I , I I I 

Db 472 D SENAYPMKNMKYSVSAWRVAVEPKNPSDLPKLIEGLKRLSKSDPLVQCMTEESGE 528 

QY 339 HVILGTGELYLDCVMHDLRKMY-SEIDIKVADPWTFCETV VETSSLKCFAETP 391 

I : : I I I I : I : : M : : : : : | : : I : I : : Ml II: I : : : | 

Db 52 9 HIVAGAGELHLEICLKDLQDDFMNGAELKISQPIVSYRETVEGVINPEETA— VCLSKSP 58 6 

QY 392 NKKNKITMIAEPLEKGLAEDIENEWQITWNRKKLGEFFQTKYDWDLLAARSIWAFGPDA 451 

I I I : I I I M : I I : I i : : : : : | : : | s | : : I I I I || 

Db 587 NKHNRI YCYAEPLPEGLSEAIDDGKIKSSDEPKTRIKELKQKFDMNEEDIKKIWCFGPDG 646 

Qy 452 TGPNILVDDTLPSEVDKAL — LGSVKDSIVQG 481 

I M II I I II : MM I 

Db 647 NGPNFLVDRT KGLQYINEIKDSCNSG 672 



RESULT 15 
S32819 

translation elongation factor eEF-2 - Chlorella kessleri 
C; Species: Chlorella kessleri 

C;Date: 06-Jan-1995 #sequence_revision 06-Jan-1995 #text_change 02-Feb-2001 

C; Accession: S32 819 

R; Schnelboegl, G. ; Tanner, W. 

Plant Physiol. 97, 469-471, 1991 

A;Title: Plant gene register. Amino acid sequence of an algal peptide elongation 

factor EF-2 deduced from the complementary DNA sequence. 

A;Reference number: S32819 

A; Accession: S32 819 

A; Status : preliminary 

A;Molecule type: mRNA 

A; Residues: 1-845 <SCH> 

A; Cross -references : EMBL:M6 8 064; NID:gl6724 4; PIDN: AAA3 302 8 . 1; PID: gl6724 5 

C; Super family: translation elongation factor 2; translation elongation factor Tu 

homology 

C; Keywords: diphthamide; GTP binding; nucleotide binding; P-loop; phosphoprotein 
F;20-161/Domain: translation elongation factor Tu homology <ETU> 



F; 2 6-3 3 /Region : nucleotide-binding motif A (P-loop) 
F; 158-161/Region: GTP-binding NKXD motif 

F; 702/Modif ied site : 2 1 - [ 3-carboxamido-3- ( t rimethylammonio ) propyl] histidine 
(His) tstatus predicted 

Query Match 30.8%; Score 779; DB 2; Length 845; 

Best Local Similarity 36.0%; Pred. No. 1.3e-49; 

Matches 169; Conservative 103; Mismatches 188; Indels 10; Gaps 6 

Qy 14 PLLGNVCFSSSQYSICFTLGSFAKI YADTFGDINYQEFAKRLWGDI YFNPKTRKFTKK— A 72 

I I I I I : : ill I I : I I II : : : : I I M : 1 : I I I : I I I 
Db 2 04 PEAGTVSFSAGLHGWAFTLTVFANMYAAKFG-TDTKRMMEKLWGDNFFDATTRKWTKKHT 262 

Qy 73 PTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIH — LTKEELKLNIRPLLRL 130 

: : I I : I I I I : : : : I I I : I : : I I : : I : M : : 

Db 2 63 GADTCKRGFCQFI YEP I KTVX EAAMNDNKDKLFDLLKKLNVYSKLKPEDRELMGKPLMKR 322 

Qy 131 VCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDPDGPLMCHT 190 

I : : : : | : | : | | | | : : : I I : I I : : I I I i I I I : 

Db 323 VMQTWLPAHEALLEMMIWHLPSPAKAQKYRVDVLYEGPLDDTYATAVRNCDADGPLMMYV 382 

Qy 191 TKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLWISVARYH 250 

: I I I : I : I I I I I I I I : t : : : I i I : : I : I I I : : I 

Db 383 SKMIPAADKGRFYAFGRVFSGRIATGRKVRIMGPNYVPGQKKDLYVKTVQRTVLCMGRRQ 442 

Qy 251 I EVNRVPAGNWVLI EGVDQ PI VKTAT I TE PRGNEEAQI FRPLKFNTT S VI KI AVE PVN P S 310 

I I I M I : t : II I I I I : I : : I : I : : II : : I : : : M i I I 
Db 443 EAVE DVP C GNT VALVG L DQ F I T KNAT LTDEK-CE DAHT I KAMK F S VS P WRVAVE P KVAS 501 

Qy 311 ELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMY-SEIDIKVAD 369 

: I I I : : : I I : : : II I : : I I : I I I : I I I I I : I : : I I : : : I : I : : 

Db 502 DLPKLVEGLKRLAKSDPMVQCTIEETGEHIIAGAGELHLEICLKDLQDDFMGGAEIRVSE 561 

Qy 37 0 PVWFCETVVETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWNRKKLGEF 429 

I I I : I I I I : i I : : : I I I I : : i I I : I I I I I I : : : I : 

Db 5 62 PWS FRETVI GT SDHWMSKS PNKHNRLYMQARPMEDGLAEAI DEGKI GPRDDPKVRSKI 621 

Qy 430 FQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGSVKDSIV 479 

: : I I I : I I I I I I I t I I :: I I : I : ! I I : I 

Db 622 LSEEFGWDKEL7VKKILAFGPDTTGPNMVTDITKGVQ YLNEIKDSW 667 



Search completed: January 30, 2004, 11:27:00 
Job time : 27.2023 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: January 30, 2004, 11:26:28 ; Search time 54.2763 Seconds 

(without alignments) 
1841.751 Million cell updates/se 



Title: US-09-98 9-4 81-3 

Perfect score: 2527 

Sequence: 1 RAYYSTDENLILSPLLGNVC LPSEVDKALLGSVKDSIVQG 481 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 789580 seqs, 207824079 residues 

Total number of hits satisfying chosen parameters: 789580 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Published_Applications_AA: * 

1 : /cgn2_6/ptodata/2/pubpaa/US07_PUBCOMB . pep : * 
2 : /cgn2_6/ptodata/2/pubpaa/PCT_NEW_PUB.pep: * 
3: /cgn2_6/ptodata/2/pubpaa/US06JNEW_PUB.pep:* 
4 : /cgn2_6/ptodata/2/pubpaa/US06_PUBCOMB.pep: * 
5 : /cgn2_6/ptodata/2/pubpaa/US07_NEW_PUB .pep : * 
6: /cgn2_6/ptodata/2/pubpaa/PCTUS_PUBCOMB.pep: * 
7 : /cgn2_6/ptodata/2/pubpaa/US08_NEW_PUB.pep: * 
8: /cgn2_6/ptodata/2/pubpaa/US08_PUBCOMB.pep: * 
9 : /cgn2_6/ptodata/2/pubpaa/US0 9A_PUBCOMB.pep:* 
10: /cgn2_6/ptodata/2/pubpaa/US09B_PUBCOMB.pep: * 
11: /cgn2_6/ptodata/2/pubpaa/US09C_PUBCOMB.pep: * 
12 : /cgn2_6/ptodata/2/pubpaa/US09_NEW_PUB.pep:* 
13 : /cgn2_6/ptodata/2/pubpaa/USl0A_PUBCOMB.pep: * 
14 : /cgn2_6/ptodata/2/pubpaa/USlOB_PUBCOMB.pep: * 
15 : /cgn2_6/ptodata/2/pubpaa/US10C_PUBCOMB.pep: * 
16: /cgn2_6/ptodata/2/pubpaa/US10_NEW_PUB.pep: * 
17 : /cgn2_6/ptodata/2/pubpaa/US60_NEW_PUB.pep: * 
18 : /cgn2_6/ptodata/2/pubpaa/US60_PUBCOMB.pep:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 
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10 


us 
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9 
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15 US-10-156-7 61-124 53 
15 US-10-15 6-7 61-9338 
12 US- 10-2 8 9-7 62-58 9 
12 US-10-2 64-04 9-23 83 
9 US-09-8 15-242-137 91 
9 US- 09-8 15-2 42-10419 
12 US-10-2 87-2 7 4-425 
15 US-10-15 6-7 61-14353 
9 US-09-8 15-242-11727 
12 US- 10- 104 -0 47-2 952 

9 US-09-8 15-242-1 1326 
12 US-09-8 82-227- 602 

10 US- 09- 7 3 8-62 6-4 721 
12 US-10-2 64-237-18 82 

11 US-09-815-379-6 

12 US-10-3 69-4 93-224 90 
12 US-10-3 69-4 93-2 084 
9 US-09-8 15-242-1364 6 
12 US-10-032-585-7031 



Sequence 212, App 
Sequence 185, App 
Sequence 4 037, Ap 
Sequence 110, App 

Sequence 110, App 
Sequence 85, Appl 
Sequence 85, Appl 
Sequence 3336, Ap 
Sequence 7 630, Ap 
Sequence 3410, Ap 
Sequence 8410, Ap 
Sequence 2, Applx 
Sequence 5238 , Ap 
Sequence 12443, A 
Sequence 12736, A 
Sequence 13233 , A 
Sequence 11858, A 
Sequence 4977, Ap 
Sequence 10481, A 
Sequence 11060, A 
Sequence 2 4 8, App 
Sequence 10363, A 
Sequence 13965, A 
Sequence 1750, Ap 
Sequence 4 055, Ap 
Sequence 11568 , A 
Sequence 12453, A 
Sequence 9 33 8 , Ap 
Sequence 589, App 
Sequence 2 383, Ap 
Sequence 13791, A 
Sequence 10419, A 
Sequence 425 , App 
Sequence 14353, A 
Sequence 11727, A 

Sequence 2952, Ap 
Sequence 11326, A 
Sequence 602, App 
Sequence 4 721, Ap 
Sequence 1882, Ap 
Sequence 6, Appli 
Sequence 22490, A 
Sequence 2 08 4 , Ap 
Sequence 13646, A 
Sequence 7 031, Ap 



ALIGNMENTS 



RESULT 1 

US-10-116-275-272 

; sequence 272, Application US/10116275 
; Publication No. US20030211476A1 
; GENERAL INFORMATION: 

; APPLICANT: Elan Pharmaceutical Technology 



APPLICANT: O'Mahony, Daniel J. 
APPLICANT: Brayden, David 
APPLICANT: Byrne, Daragh 
APPLICANT: Lambkin, Imelda 
APPLICANT: Higgins, Lisa 

TITLE OF INVENTION: Genetic Analysis of Beyer's Patches and M Cells and 
Methods and 

TITLE OF INVENTION: Compositions Targeting Peyer 1 s Patches and M Ceil 

Receptors 

FILE REFERENCE: E1067/20087 

CURRENT APPLICATION NUMBER: US/ 10/116, 275 
CURRENT FILING DATE: 2002-10-04 
NUMBER OF SEQ ID NOS : 349 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 272 
LENGTH: 856 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-116-275-272 

Query Match 34.4%; Score 870; DB 12; Length 856; 

Best Local Similarity 37.3%; Pred. No. 5.9e-78; 

Matches 182; Conservative 95; Mismatches 189; Indels 22; Gaps o; 
Qy 9 NLILSPLLGNVCFSSSQYSICFTLGSFAKI YADTF GDI NYQEFAKRLW 56 



1 : : : I : I I I i I : I I 1 I I : : I 1 ' : : : 1 : 1 1 _ 

2 02 NIMIDPVLGTVGFGSGLHGWAFTLKQFAEMYVAKFAAKGEGQLGPAERAKKVEDMMKKLW 2 6 

57 GDI YFNPKT RKFTKKAPTSS SQ RSFVEFILEPLYKILAQWGDVDTSLPRTLDELGI 113 

| | | | : | | I : I I : : I : I : | | : | : : I : : : : : : : I I 

2 62 GDRYFDPANGKFSKSATSPEGKKLPRTFCQLILDPIFKVFDAIMNFKKEETAKLIEKLDI 321 

114 HLTKEELKLNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDL 17 3 

| | s : M | : I : : : : I I : I I I t : I II I * 

322 KLDSEDKDKEGKPLLKAVMRRWLPAGDALLQMITIHLPSPVTAQKYRCELLYEGPPDDEA 381 

17 4 GEAMSDCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEED 233 

: 1 | | | I M : : I I II : I : I I I I I I = I I : : : I ' ' ' : ' ' _ 
382 AMGIKSCDPKGPLMMYISKMVPTSDKGRFYAFGRVFSGLVSTGLKVRIMGPNYTPGKKED 441 

234 SQICTVGRLWISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLK 2 93 

s : | : : | | : I I I I I : I I I I : I I I I I I II ' : 1 

442 LYLKPIQRTILMMGRYVEPIEDVPCGNIVGLVGVDQFLVKTGTITT FEHAHNMRVMK 498 

294 FNTTSVIKIAVEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVM 353 

| : : | : : : | I I I I : : I I I : : : M : : : I I I : : I I I I I I : I I M I : I : : 

4 99 FS VS P WRVAVEAKN PADLPKLVEGLKRLAKSDPMVQCI I EESGEHI IAGAGELHLEI CL 558 

354 HDLRKMYSEIDIKVADPWTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIE 413 

|| : :: I II :||||:: III I I:: I — IN I" ' ' 1 MINI: 
559 KDLEEDHACIPIKKSDPWSYRETVSEESNVLCLSKSPNKHNRLYMKARPFPDGLAEDID 618 

Qv 414 NEWQITWNRKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGS 473 

| | : : I | : M : I I t I I I I 1 I M I I I I I : I 

Db 619 KGEVSARQELKQRARYLAEKYEWDVAEARKIWCFGPDGTGPNILTDITKGVQ YLNE 674 



Db 

QY 
Db 

Qy 

Db 

QY 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



Qy 



474 VKDSIVQG 481 



=111=1 I 

Db 675 I KD SWAG 682 

RESULT 2 

US-10-108-605-185 

Sequence 185, Application US/10108605 
Publication No. US 2 0 02 0160934A1 
GENERAL INFORMATION: 
APPLICANT : Broadus , Julie 
APPLICANT : Stam, Lynn 
APPLICANT : Bachmann, Jane 
APPLICANT : Kamdar, Kim 

TITLE OF INVENTION: NUCLEIC ACID SEQUENCES FROM DROSOPHILA MELANOGASTER THAT 
ENCODE 

TITLE OF INVENTION: PROTEINS ESSENTIAL FOR LARVAL VIABILITY AND USES THEREOF 
FILE REFERENCE: 31133B 

CURRENT APPLICATION NUMBER: US/ 10/ 108 , 605 
CURRENT FILING DATE: 2002-03-27 
PRIOR APPLICATION NUMBER: US 09/761,142 
PRIOR FILING DATE: 2001-01-16 
PRIOR APPLICATION NUMBER: US 60/176,418 
PRIOR FILING DATE: 2000-01-14 
NUMBER OF SEQ ID NOS : 3 61 
SOFTWARE: Patentln Ver . 2.1 
SEQ ID NO 185 
LENGTH: 8 44 
TYPE: PRT 

ORGANISM: Drosophila melanogas ter 
US-10-108-605-185 

Query Match 33.5%; Score 845.5; DB 14; Length 844; 

Best Local Similarity 37.2%; Pred. No. 1.7e-75; 

Matches 175; Conservative 103; Mismatches 183; Indels 9; Gaps 4; 

Qy 12 LSPLLGNVCFSSSQYSICFTLGSFAKI YADTFGDINYQEFAKRLWGDI YFNPKT RKFTKK 71 

: I I : I I t : III I : : : I : : ! I : : 1 I I I : : I! II : I : : t 
Db 208 VDPSKGSVGFGSGLHGWAFTLKQFSEMYSEKF-KIDWKLMNRLWGENFFNAKTKKW-QK 265 

Qy 72 APTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLNIRPLLRLV 131 

: : : I I I : I I : I : I I : : : : I : : : ! : I I : : : I I : I 

Db 266 QKEADNKRSFCMYILDPIYKVFDAIMNYKKEEIGTLLEKIGVTLKHEDKDKDGKALLKTV 325 

Qy 132 CKKFFGEFTGFVDMCVQHIPSPKVG7VKPKIEHTYTGGVDSDLGEAMSDCDPDGPLMCHTT 191 

: : : I I : I I I I I : : I II I : I : I I I I I I I I : : 

Db 326 MRTWLPAGEALLQMIAIHLPSPWAQKYRMEMLYEGPHDDEAAIAVKSCDPDGPLMMYIS 385 

Qy 192 KMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLWISVARYHI 2 51 

II II : | : | M I : I : II : : : I I I I : I I : I : : I I 

Db 386 KMVPTSDKGRFYAFGRVFAGKVATGQKCRIMGPNYTPGKKEDLYEKAIQRTILMMGRYVE 445 

Qy 252 EVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIAVEPVNPSE 311 

: I I : I I : I I I I : I I I IN : : I : : M : : I : : : I I I I It:: 

Db 446 AI EDVP S GN I CGLVGVDQFLVKT GT ITT FKDAHNMKVMKFSVSPWRVAVEPKNPAD 502 

Qy 312 LPKMLDGLRKWKSYPSLTTKVEESGEHVILGTGELYLDCAMHDLRKMYSEIDIKVADPV 371 

I M : : : I I : : : II I : : I I M I I : I I I I I : I : : II : : : I : I : M I 



Db 



5 03 LPKLVEGLKRLAKSDPMVQCIIEESGEHI IAGAGELHLEICLKDLEEDHACI PLKKSDPV 5 62 



Qy 372 VTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWNRKKLGEFFQ 431 

I :: I I I I I | : : : | M | : : | | | : II I I I : I | | : 

Db 563 VS YRETVSEESDQMCLSKSPNKHNRLLMKAL PMPDGLPEDI DNGEVSAKDEFKARARYLS 622 

Qy 432 TKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGSVKDSIVQG 481 

I I I : I : I I I I I I I I I I 1 I :: I I : j : | | | : | | 

Db 623 EKYDYDVTEARKIWCFGPDGTGPNFILDCTKSVQ YLNEIKDSWAG 668 

RESULT 3 

US- 10- 12 8-7 14-4 037 

Sequence 4037, Application US/10128714 
Publication No. US20030119013A1 
GENERAL INFORMATION: 
APPLICANT: Jiang, Bo 
APPLICANT: Hu, Wenqi 
APPLICANT: Tishkoff, Daniel 
APPLICANT: Zamudio, Carlos 
APPLICANT: Eroshkin, Alexey M 
APPLICANT: Lemieux, Sebastien M 

TITLE OF INVENTION: Identification of Essential Genes in Aspergillus 
f umigatus and 

TITLE OF INVENTION: Methods of Use 
FILE REFERENCE: 10182-018-999 
CURRENT APPLICATION NUMBER: US/10/12 8,714 
CURRENT FILING DATE: 2002-04-23 
PRIOR APPLICATION NUMBER: US 60/285,697 
PRIOR FILING DATE: 2001-04-23 
PRIOR APPLICATION NUMBER: US 60/287,066 
PRIOR FILING DATE: 2001-04-27 
PRIOR APPLICATION NUMBER: US 60/295,890 
PRIOR FILING DATE: 2001-06-05 
PRIOR APPLICATION NUMBER: US 60/303,899 
PRIOR FILING DATE: 2001-07-09 
PRIOR APPLICATION NUMBER: US 60/316,362 
PRIOR FILING DATE: 2001-08-31 
NUMBER OF SEQ ID NOS : 8603 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 4037 
LENGTH: 8 84 
TYPE: PRT 

ORGANISM: Aspergillus fumigatus 
US-10-128-714-4037 

Query Match 31.9%; Score 806; DB 15; Length 884; 

Best Local Similarity 36.0%; Pred. No. 1.7e-71; 

Matches 169; Conservative 104; Mismatches 183; Indels 14; Gaps 5; 

QY 14 PLLGNVCFSSSQYSICFTLGSFAKI YADTFGDINYQEFAKRLWGDI YFNPKTRKFTKKAP 73 

I Mil : It: II II II:::: : | | | M I I I I : I : I : I I 
Db 2 01 PDRGTVAFGSGLHGWAFTVRQFAVKYAKKFG-VDRKKMLERLWGDNYFNPQTKKWTKTG- 25 8 

QY 74 TSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLNIRPLLRLVCK 133 

I I : I I | I : I : : I I I | I : : : : : | | | : | | : | | : : : : 

Db 259 -EPEQRAFNMFILDPIFKIFAAVNNDKTEEIHKLVEKLEIKLASDEKDLKGKALLKVIMR 317 



Qy 134 KFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDPDGPLMCHTTKM 193 

II : : I I : I M I : t I I : I : : I I I I I I I : : I I 

Db 318 KFLPAADAMLEMICIHLPSPVTAQKYRAETLYEGPMDDECAIGIRDCDPKAPLMLYVSKM 377 

Qy 194 FSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLWISVARYHIEV 253 

II : i : I I I I I I : : I I : : I I I : : I : : I : : I : : 

Db 378 VPTSDKGRFYAFGRVFSGIVKSGLKVRIQGPNYIPGKKDDLFVKAIQRTILMMGRFVEPI 437 

Qy 254 NRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIAVEPVNPSELP 313 

f I I I I I : I I I I : : I : i : I : I I : : I | : : I : : I I I : I I 

Db 438 EDVPAGNIVGLVGVDQFLLKSGTLTT SETAHNLKVMKFSVSPWQRGVEVKNAQDLP 494 

Qy 314 KMLDGLRKWKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEIDIKVADPVVT 373 

I : : : I I : : : : I I ! : I : I I I : I : : I I M : [ : : II : : : : : : : : I I I I : 
Db 495 KLVEGLKRLSKSDPCVLTMINESGQHIVAGAGELHLEICLKDLEEDHAGVPLRISDPWS 554 

Qy 374 FCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWNRKKLGEFFQTK 433 

: M | II: : : : | | | | : : : | : | | : : : | | : : : I 

Db 555 YRETVGGES SMTALSKS PNKHNRLYVTAQPLGEEVS LAI ES GKINPRDDFKARARLLADD 614 

Qy 434 YDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKAL — LGSVKDSIVQG 481 

III: II II MM II hill I II: I : I I I I I 

Db 615 YGWDVTDARKIWCFGPDTTGANLLVDQT KAVQ YLN E I KD S FVS G 658 



RESULT 4 

US-09-874-923-110 

Sequence 110, Application US/09874923 
Patent No. US20020081320A1 
GENERAL INFORMATION: 
APPLICANT: Reed, Steven G. 
APPLICANT: Campos-Neto, Antonio 
APPLICANT: Webb, John R. 
APPLICANT: Dillon, Davin C. 
APPLICANT: Skeiky, Yasir A.W. 
APPLICANT: Bhatia, Aj ay 
APPLICANT: Coler, Rhea 
APPLICANT: Probst, Peter 
APPLICANT: Brannon, Mark 

TITLE OF INVENTION: LEI SHMANIA ANTIGENS FOR USE IN THE 
TITLE OF INVENTION: THERAPY AND DIAGNOSIS OF LEISHMANIASIS 
FILE REFERENCE: 210121. 420C8 

CURRENT APPLICATION NUMBER: US / 0 9 / 8 7 4 , 92 3 
CURRENT FILING DATE: 2001-06-04 
NUMBER OF SEQ ID NOS : 122 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 110 
LENGTH: 845 
TYPE: PRT 

ORGANISM: Leishmania major 
FEATURE : 

NAME/ KEY: VARIANT 
LOCATION: (1) . . . (845) 

OTHER INFORMATION: Xaa = Any Amino Acid 
US-09-874-923-110 



Query Match 29.4%; Score 743; DB 9; Length 845; 

Best Local Similarity 34.7%; Pred. No. 3.5e-65; 

Matches 166; Conservative 102; Mismatches 189; Indels 22; Gaps 9 



Qy 12 LSPLLGNVCFSSSQYSICFTLGSFAKIYADTFGDINYQEFAKRLWGDIYFNPKTRKFTKK 71 

: | | || | : I : ! I I : i I I I : : : : | | I | | : | : I : I : I : 

Db 200 VSPEKGTVAIGSGLQAWAFSLTRFANMYAAKFG-VDELKMRERLWGDNFFDAKNKKWIKQ 258 

Qy 72 APTSSSQ RSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLNIRPLL 12 8 

: : I : I : I I : I : i : I I : : : : I I : I I I I : M 

Db 259 ETNADGERVRRAFCQFCLDPI YQIFDAVMNEKKDKVDKMLKSLHVTLTAEEREQVPXKLL 318 

Qy 129 RLVCKKFFGEFTGFVDMCVQHI PSPKVGAKPKIEHTYTGGVDSD LGEAMSDCDPDG 184 

: I I : I i I : I i i I : I ! : I : : | : : I I I 

Db 319 KTVMMXFLPAAETLLQMIVAHLPSPKKAQAYRAEMLYSGEASPEDKYFMG — IKNCDPAA 37 6 

Qy 185 PLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEED-SQICTVGRLW 243 

i | | : : II II : I I I I : I I : : I I I : : : I I I : : I : II 

Db 37 7 PLMLYISKMVPTADRGRFFAFGRIFSGKVRSGQKVRIMGNNYVYGKKQDLYEDKPVQRSV 43 6 

Qy 244 ISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIA 303 

: : I I I : | | | I : I I I : I I! : I I I I : I | : 1 : : : I : : : I 

Db 437 LMMGRYQEAVEDMPCGNWGLVGVDKYIVKSATITD — D G E S P H P LRDMK Y S VS PWRVA 4 94 

Qy 304 VEPVNPSELPKMLDGLRKVNKS YPSLTTKVEESGEHVILGTGELYLD.CVMHDLRKMY— SE 362 

II | | | : | [ | : : : | | : : : II I : : M I I I I : : I I I I : I : : I I : : : : 

Db 495 VEAKNPSDLPKLVEGLKRLAKSDPLWCSIEESGEHIVAGAGELHLEICLKDLQEDFMNG 554 

Qy 3 63 I DI KVADPVVTFCETVVET S S LKCFAETPNKKNKITMI AEPLEKGLAEDI ENEWQITWN 422 

: I : : : I I I : I I I I : I I : I : : : I I I : : I I : I I = I : 

Db 555 APLKISEPWSFRETVTDVSSQQCLSKST^NKHNRLFCRGAPLTEELALAMEEGTAGPEAD 614 

Qy 423 RKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKAL — LGSVKDSIV 47 9 

I I I : I I : I I I I : I I I I I I : : I I I I = : : I I I I 

Db 615 PKVRARFLADNYEWDVQEARKIWCYGPDNRGPNVWD VTKGVQNMAEMKDSFV 667 



RESULT 5 

US-09-991-496-110 

Sequence 110, Application US/09991496 
Patent No. US2 002 01 692 8 5A1 
GENERAL INFORMATION: 
APPLICANT: Reed, Steven G. 
APPLICANT: Campos-Neto, Antonio 
APPLICANT: Webb, John R. 
APPLICANT: Dillon, Davin C. 
APPLICANT: Skeiky, Yasir A.W. 
APPLICANT: Bhatia, Ajay 
APPLICANT: Coler, Rhea 
APPLICANT: Probst, Peter 
APPLICANT: Brannon, Mark 

TITLE OF INVENTION: LEISHMANIA ANTIGENS FOR USE IN THE 
TITLE OF INVENTION: THERAPY AND DIAGNOSIS OF LEISHMANIASIS 
FILE REFERENCE: 210121. 420C9 

CURRENT APPLICATION NUMBER: US/09/991, 4 96 
CURRENT FILING DATE: 2001-11-20 
NUMBER OF SEQ ID NOS : 137 



; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID MO 110 
; LENGTH: 845 
; TYPE: PRT 

; ORGANISM: Leishmania major 
FEATURE: 

NAME/ KEY : VARIANT 
LOCATION: 315, 324 

OTHER INFORMATION: Xaa = Any Amino Acid 
US-09-991-496-110 

Query Match 29.4%; Score 743; DB 10; Length 845; 

Best Local Similarity 34.7%; Pred. No. 3.5e-65; 

Matches 166; Conservative 102; Mismatches 189; Indels 22; Gaps 9 

Qy 12 LSPLLGNVCFSSSQYSICFTLGSFAKI Y7VDTFGDINYQEFAKRLWGDI YFNPKTRKFTKK 71 

: I I || I : I : I i I : I I M : : : : I I M I : I : I : I : I : 
Db 200 VSPEKGTVAIGSGLQAWAFSLTRFANMYAAKFG-VDELKMRERLWGDNFFDAKNKKWIKQ 258 

Qy 72 APTSSSQ RSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLNIRPLL 128 

: : | : | : | | : | : | : | | : : : : I I : I I I I : I I 

Db 259 ETNADGERVRRAFCQFCLDPIYQIFDAVMNEKKDKVDKMLKSLHVTLTAEEREQVPXKLL 318 

Qy 129 RLVCKKF FGE FT G FVDMCVQH I P S P KVGAKP K I EHT YT GGVD S D LGEAMSDCDPDG 184 

: I I : I I 1 : I II I : I I : I : : I : : M I 

Db 319 KTVMMXFLPAAETLLQMIVAHLPSPKKAQAYRAEMLYSGEASPEDKYFMG — IKNCDPAA 376 

Qy 185 PLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEED-SQICTVGRLW 243 

II I : : I I II : I | | | : | | : : I I I : : : I I I : : I : II 

Db 377 PLMLYISKMVPTADRGRFFAFGRIFSGKVRSGQKVRIMGNNYVYGKKQDLYEDKPVQRSV 436 

Qy 244 ISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIA 303 

: : I I I : | | | | : | | | : j | | : I I I I : I | : | : : : | : : : | 

Db 437 LMMG R YQ EAVE DMP C GNWGLVGVD K Y I VK S AT I T D — DGESPHPLRDMKYSVSPWRVA 4 94 

Qy 304 VE PVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMY— SE 362 

II | j | : | | | : : : | | : : : | | 1 : : I M II I : : I I I I : I : : I I : : : : 
Db 4 95 VEAKNPSDLPKLVEGLKRLAKSDPLWCSIEESGEHIVAGAGELHLEICLKDLQEDFMNG 554 

Qy 363 IDIKVADPWTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWN 422 

: I : : : I I I : I I M : I I : I : : : I I I : : I I : I I : I : 

Db 555 APLKISEPWSFRETVT DVSSQQCLSKSANKHNRLFCRGAPLTEELALAMEEGTAGPEAD 614 

Qy 423 RKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKAL — LGSVKDSIV 479 

I I I : I I : IIIIMII I ! I : : M I I : : : I I ! I 

Db 615 PKVRARFLADNYEWDVQEARKIWCYGPDNRGPNVWD VTKGVQNMAEMKDSFV 667 



RESULT 6 

US-09-874-923-85 

; Sequence 85, Application US/09874923 
; Patent No. US20020081320A1 
; GENERAL INFORMATION: 

APPLICANT: Reed, Steven G. 

APPLICANT: Campos-Neto, Antonio 

APPLICANT: Webb, John R. 
; APPLICANT: Dillon, Davin C. 



APPLICANT: Skeiky, Yasir A.W. 
APPLICANT: Bhatia, Aj ay 
APPLICANT: Coler, Rhea 
APPLICANT: Probst, Peter 
APPLICANT: Brannon, Mark 

TITLE OF INVENTION: LEISHMANIA ANTIGENS FOR USE IN THE 
TITLE OF INVENTION: THERAPY AND DIAGNOSIS OF LEISHMANIASIS 
FILE REFERENCE: 210121. 420C8 

CURRENT APPLICATION NUMBER: US/ 09/874, 923 
CURRENT FILING DATE: 2001-06-04 
NUMBER OF SEQ ID NOS : 122 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 85 
LENGTH: 3 61 
TYPE: PRT 

ORGANISM: Leishmania major 
FEATURE : 

NAME / KEY : VARIANT 
LOCATION: (1) ... (361) 

OTHER INFORMATION: Xaa = Any Amino Acid 
US-09-874-923-85 

Query Match 21.9%; Score 554.5; DB 9; Length 361; 

Best Local Similarity 34.6%; Pred. No. 8.1e-47; 

Matches 122; Conservative 82; Mismatches 136; Indels 13; Gaps 6 

Qy 66 RKFTKKAPTSSSQ RSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKL 122 

: I : I : : : I : I : i I : I : i : I I : : : : I I : I I I I : 

Db 1 KKWIKQETNADGERVRRAFCQFCLDPIYQIFDAVMNEKKDKVDKMLKSLHVTLTAEEREQ 60 

Qy 123 NI RPLLRLVCKKFFGEFTGFVDMCVQHI P S PKVGAKPKI EHTYTGGVDSD LGEAMS 17 8 

I I : I f : I I I : I I I I : I I : I : : I : 

Db 61 VPXKLLKTVMMXFLPAAETLLQMIVAHLPSPKKAQAYRAEMLYSGEASPEDKYFMG — IK 118 

Qy 17 9 DCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEED-SQIC 237 

: I 1 I I I I : : I I I ! : I || | : I | : : I I I : : : I I I : : I : 

Db 119 NCDPAAP LMLYISKMVPTADRGRFFAFGRIFSGKVRSGQKVRIMGNNYVYGKKQDLYEDK 178 

Qy 238 TVGRLWISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTT 297 

II : : I t t : I f I I : I I I : I II : I I I I : I | : | : : : 

Db 17 9 PVQRSVLMMGRYQEAVEDMPCGNWGLVGVDKYIVKSATITD — DGE S PH P LRDMKYS VS 23 6 

Qy 298 SVIKIAVE PVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVI LGTGELYLDCVMHDLR 357 

[ : : : I I I I I I : I I I : : : I I : : : II I : : I M I M : : I I I I : I : : I I : 

Db 237 PWRVAVEAKNPSDLPKLVEGLKRLAKSDPLWCSIEESGEHIVAGAGELHLEICLKDLQ 2 96 

Qy 358 KMY-SEIDIKVADPWTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLA 409 

: : : : [ : : : I I I : I I I I : I i : I : : : [ I I : : I I : I I 

Db 297 EDFMNGAPLKISEPWSFRETVTDVSSQQCLSKSANKHNRLFCRGAPLTEXLA 349 



RESULT 7 

US-09-991-496-85 

; Sequence 85, Application US/09991496 

; Patent No. US20020169285A1 

; GENERAL INFORMATION: 

; APPLICANT: Reed, Steven G. 



APPLICANT: Campos-Neto, Antonio 
APPLICANT: Webb, John R. 
APPLICANT: Dillon, Davin C. 
APPLICANT: Skeiky, Yasir A.W. 
APPLICANT: Bhatia, Aj ay 
APPLICANT : Coler , Rhea 
APPLICANT: Probst, Peter 
APPLICANT: Brannon, Mark 

TITLE OF INVENTION: LEISHMANIA ANTIGENS FOR USE IN THE 
TITLE OF INVENTION: THERAPY AND DIAGNOSIS OF LEISHMANIASIS 
FILE REFERENCE: 210121. 420C9 

CURRENT APPLICATION NUMBER: US/ 09/991,496 
CURRENT FILING DATE: 2001-11-20 
NUMBER OF SEQ ID NOS : 137 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 85 
LENGTH: 361 
TYPE: PRT 

ORGANISM: Leishmania major 
FEATURE : 

NAME /KEY: VARIANT 

LOCATION: 63, 72, 347, 352, 353, 360 
OTHER INFORMATION: Xaa = Any Amino Acid 
US-09-991-496-85 

Query Match 21.9%; Score 554.5; DB 10; Length 361; 

Best Local Similarity 34.6%; Pred. No. 8.1e-47; 

Matches 122; Conservative 82; Mismatches 136; Indels 13; Gaps 6 

Qy 66 RKFTKKAPTSSSQ RSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKL 122 

: I : I : : : I : I : I I : ! : I : I 1 : : : : I I : I i M : 

Db 1 KKWIKQETNADGERVRRAFCQFCLDPIYQIFDAVMNEKKDKVDKMLKSLHVTLTAEEREQ 60 

Qy 123 NI RPLLRLVCKKFFGEFTGFVDMCVQHI P S PKVGAKPKIEHT YTGGVDSD LGEAMS 178 

I I : [ I : i I I : M i I : I I : I : : I : 

Db 61 VPXKLLKTVMMXFLPAAETLLQMIVAHLPSPKKAQAYRAEMLYSGEASPEDKYFMG — IK 118 

Qy 179 DCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEED-SQIC 237 

: I I I I M : : I I I i : I I II : I I : : i i I : : : I I I : : ! : 

Db 119 NCDPAAPLMLYISKMVPTADRGRFFAFGRIFSGKVRSGQKVRIMGNNYVYGKKQDLYEDK 178 

Qy 238 TVGRLWISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTT 297 

II : : I I I : I I I I : M I : ! 1 i : I I I I : I | : I : : : 

Db 179 PVQRSVLMMGRYQEAVEDMPCGNWGLVGVDKYIVKSATITD — DGES PHPLRDMKYSVS 236 

Qy 2 98 SVIKIAVEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLR 357 

I : : : I I I I I I : I I I : : : I I : : : i I I : : ! I I 1 I I : : i I I I : I : : I I : 
Db 237 PWRVAVEAKNPSDLPKLVEGLKRLAKSDPLWCSIEESGEHIVAGAGELHLEICLKDLQ 296 

Qy 358 KMY-SEIDIKVADPVVTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLA 409 

: : : : I : : : I II : I I I I : I I : I : : : I I I : : II = 11 

Db 297 EDFMNGAPLKI SEPWS FRETVT DVS SQQCLSKSANKHNRLFCRGAPLTEXLA 349 



RESULT 8 

US- 10-32 0-7 97-3336 

; Sequence 3336, Application US/10320797 



Publication No. US2 0 04 0014 955A1 
GENERAL INFORMATION: 
APPLICANT: Eroshkin, Alexey M. 
APPLICANT: Zamudio, Carlos 

TITLE OF INVENTION: IDENTIFICATION OF ESSENTIAL GENES OF CRYPTOCOCCUS 
NEOFORMANS AND 

TITLE OF INVENTION: METHODS OF USE 
FILE REFERENCE: 10182-021-999 
CURRENT APPLICATION NUMBER: US/ 1 0/ 32 0 , 1 9 7 
CURRENT FILING DATE: 2002-12-16 
PRIOR APPLICATION NUMBER: 60/341,261 
PRIOR FILING DATE: 2001-12-17 
NUMBER OF SEQ ID NOS : 3361 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 3336 
LENGTH: 1068 
TYPE: PRT 

ORGANISM: Cryptococcus neoformans 
US- 10-32 0-7 97-333 6 

Query Match 17.1%; Score 431.5; DB 12; Length 1068; 

Best Local Similarity 22.4%; Pred. No. 1.2e-33; 

Matches 135; Conservative 114; Mismatches 191; Indels 163; Gaps 17 

Qy 7 DENLILSPLLGNVCFSSSQYSICFTLGSFAKIYADTFGDINYQEFAKRLWGDI YFNPKTR 66 

If:: : I M I I : I : I I I I I : = I I : I : I I I I I : I I I : 

Db 224 DEDI YFAPDRGNVLFASAIDGWAFRLGKFARLYAEKL-KIKEGNLRRVLWGDWYLDPKTK 282 

Qy 67 KFT — KKAPTSSSQRSFVEFILEPLYKILAQWGDVD-TSLPRTLDELGIHLTKEELK-L 122 

: || : : | | : | : | | : : : : | : : : : : : : I I : I : 1 : 

Db 283 RWGRKKLAGRNLKPMFVQFVLENIWRVYDTVLNEYNPDAVQKIVTALNIRITPRDLRSK 342 

Qy 123 NIRPLLRLVCKKFFGEFTGFVDMCVQHI PSPKVGAKPKIEH TYTGGV — D 170 

: I I I t : : : : i : : I I I : = : = I I * • 

Db 343 DTRNLLNLIMQQWLPLSTATFQSIIEVIPPPPSAQAI RLPYMLHPEKAKAAAASGGLKAE 402 

Qy 171 SDLGEAMSDCD — PDGPLMCHTTKMFSTHDG — VQFHP 2 04 

: : I : : I I : : : : II I : I : : I 

Db 4 03 NELERGLYECDQGEGAEWAYVSKMFAVRKGDLPEYKPKEMTAEEMRARGREERERRAAL 4 62 

Qy 2 05 FGRVLSGT 212 

I I : ! I 

Db 463 VAERQAKGEGLDGQPLPEDLAKPLESLSLENIQPATSEKPAVDDSDSEVLLGFSRIFSST 522 

Qy 213 IHAGQPVKVLGENYTLEDEEDSQI CTVGRLWI SVARYHI EVNRVPAG 259 

: | | : : : I I : I : : : I : I : I I I I 

Db 523 LHRGTSLLAILPKF D S S L P P S H PHN I KHT VP 1 1 AS DL YMMMGREL VS VD S VPAG 576 

Qy 260 NWVLIEGVDQPIVKTATITEPR GNEEAQIFRPLKFNTTSVIKIAVEPVNPSE 311 

: | | :::::: | | : | I I : : : : : : | : | | Mi: 

Db 577 HVC AI GG LN RAVP RS AT LWAP DAK GVE E G FG K EALVN LAGVGVGAN AI VRVAL E P EN PSD 636 

Qy 312 LPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEIDIKVADPV 371 

: I I : : I I I : I : : I I : I I I M I I : I I I : I : : I I I : : : : I : : I : 

Db 637 MPKLIRGLRILNQADPCAEYFVQESGEHVIITAGELHLERCLKDLRERFAKCPIQQSAPI 696 



QY 



372 VT FCETWET S S LKCFAET PNKKN KITMIAEPLEKGLAEDIENE 415 



I ! | | I : : II | : | | | : | : : : : 

Db 697 VP FRETAVKAP DM APPKTTGAPRGTINGTVINGLVKFRLRAMPLPEGVETFLLSQ 7 51 

Qy 416 WQIT WNRKKLGEFFQTKY DWDLLAARSIWAFGPDATGPNI 456 

I : I : III: II I I : I : I I I 11 = 

Db 7 52 QGAISKMLVRERDGKEARQLSPEEFWTELERLLNKAGGDW-AGAADRVWSFGPKRVGANL 810 

Qy 457 LVD 459 

I : I 

Db 811 LLD 813 



RESULT 9 

US-10-032-5 8 5-7 630 

Sequence 7630, Application US/10032585 
Publication No. US20030180953A1 
GENERAL INFORMATION: 
APPLICANT: Terry, Roemer D. 
APPLICANT: Bo, Jiang 
APPLICANT: Charles, Boone 
APPLICANT: Howard, Bussey 

TITLE OF INVENTION: Gene Disruption Methodologies for Drug Target Discovery 
FILE REFERENCE: 10182-005-999 
CURRENT APPLICATION NUMBER: US / 1 0/ 032 , 5 8 5 
CURRENT FILING DATE: 2001-12-20 
NUMBER OF SEQ ID NOS : 8000 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 7630 
LENGTH: 1044 
TYPE: PRT 

ORGANISM: Candida albicans 
US- 10-032-58 5-7 63 0 

Query Match 16.1%; Score 406.5; DB 12; Length 1044; 

Best Local Similarity 23.9%; Pred. No. 3.6e-31; 

Matches 146; Conservative 102; Mismatches 199; Indels 165; Gaps 22 

Qy 6 TDENLILSPLLGNVCFSSSQYSICFTLGSFAKIYADTFGDINYQEFAKRLWGDI YFNPKT 65 

: | I : t : I I ! I : I : I : : : I II I I : I : I : I I I I I I : I 

Db 214 SDEDLYFTPEKNNVI FASAIDGWAFS VNTFAKI YSKKLG-FSQQAXSKTLWGDFYLDMKN 272 

Qy 66 RKFT — KKAPTSSSQRS — FVEFI LEPLYKILAQ-WGDVDTSLPRTLDELGIHLTKEEL 120 

: | || : I : II I I : : : : I : | : : : : I I : I : I 

Db 273 KKIIPGKKLKNNSNSLKPLFVSLILDQVWAVYENCVIERNQDKLEKIIEKLGAKITPRDL 332 

Qy 121 K-LNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKI EHTYTGGVDSD 172 

: : : M I : : : : : : : : I I I I : : I : I I I I : 

Db 333 RSKDYKNLLNLIMSQWIPLSHAILGSVIEYLPSPIVAQRERIDKILDETI YSAVDSESDK 392 

Qy 173 LGEAMSDCD PDGPLMCHTTKMFS THDGVQ 2 01 

: I I : I I I : : : : | : I I I : I 

Db 393 SKLVDPSFVKAMQECDSSHPETHTIAYVSKLLSIPNEDLPKASNAATGGLTADEIQERGR 452 



Qy 

Db 



202 FHP 204 

I 

453 IARELAKKAS EAAALAQEGSKNEDEFAIKPKKDPFEWEFEEDDFENEEDESDANAVEEST 512 



Qy 205 FGRVLSGTIHAGQPVKVLGENYTLEDEEDSQI CTVGRLWI SVARYHI 251 

I I : I I : : I i : I : I I II : I : : : I : 

Db 513 ETIVGFTRI YSGSLSRGQKLTVIGPKYDPSLPRDHQTNFEQITNEVEIKDLFLIMGRELV 572 

Qy 252 EVNRVPAGNWVL I EGVDQP I VKTAT I TE P RGNEEAQI FRP LKFNTT S VIKIAV 304 

: : I i I I I I : I : I : : I I I I I : : I : I I : : I I I I 

Db 573 RMEKVP AGN I VGWGL DNAVL KNAT ICSPLPEDKPYI NLASTSTLIHNKPIMKIAV 628 

Qy 305 EPVNPSELPKMLDGLRKVNKSYPSLTTKV-EESGEHVILGTGELYLDCVMHDLRKMYSE- 362 

I | | | : | | : II : I : 1 I I : I I M : : I I I : I : : i I 

Db 62 9 EPTNPIKLAKLERGLDLLAKADPVLEWYVDDESGELIVCVAGELHLERCLKDLEERFAKG 68 8 

Qy 363 I D I KVADPWT FCETWET S S LKC FAET PNKKN KITMIAEPLEKG 407 

: : I : I I : I I : : I I I : = II 

Db 689 CEVTVKEPVI P FREGLADD KI STNTNNNNDDNEDHELDENEDELADLEFDI SPLPLE 745 

Qy 408 LAED-IENE — WQITWNRKKLGEF FQT KYDWDLLAAR SIWAFG 448 

: : I I I I : : I I : : I III I I : =1 II 

Db 746 VTQFLIENETIIAEIVNNKQDTHEIRNDFIEKFATIIDNSNLATQFPDTKSFINNI ICFG 805 

Qy 44 9 PDATGPNI LVDD 4 60 

I MM : : I 
Db 806 PKRVGPNI FIED 817 



RESULT 10 

US- 10-12 8-7 14-34 10 

; Sequence 3410, Application US/10128714 
; Publication No. US20030119013A1 
; GENERAL INFORMATION: 
; APPLICANT: Jiang, Bo 
; APPLICANT: Hu, Wenqi 

APPLICANT: Tishkoff, Daniel 
; APPLICANT: Zamudio, Carlos 
; APPLICANT: Eroshkin, Alexey M 

APPLICANT: Lemieux, Sebastien M 
; TITLE OF INVENTION: Identification of Essential Genes in Aspergillus 
fumigatus and 

TITLE OF INVENTION: Methods of Use 
FILE REFERENCE: 10182-018-999 
CURRENT APPLICATION NUMBER: US/ 10/ 12 8 ,714 
CURRENT FILING DATE: 2002-04-23 
PRIOR APPLICATION NUMBER: US 60/285,697 
PRIOR FILING DATE: 2001-04-23 
PRIOR APPLICATION NUMBER: US 60/287,066 
PRIOR FILING DATE: 2001-04-27 
PRIOR APPLICATION NUMBER: US 60/295,890 
PRIOR FILING DATE: 2001-06-05 
PRIOR APPLICATION NUMBER: US 60/303,899 
PRIOR FILING DATE: 2001-07-09 
PRIOR APPLICATION NUMBER: US 60/316,362 
PRIOR FILING DATE: 2001-08-31 
NUMBER OF SEQ ID NOS : 8 603 
SOFTWARE: Patent In version 3.1 
SEQ ID NO 3410 
LENGTH: 1013 
TYPE: PRT 



; ORGANISM: Aspergillus fumigatus 
US- 10- 12 8-7 14-34 10 



Query Match 15.3%; Score 387.5; DB 15; Length 1013; 

Best Local Similarity 23.2%; Pred. No. 2.8e-29; 

Matches 136; Conservative 97; Mismatches 213; Indels 141; Gaps 16; 

Qy 7 DENLILSPLLGNVCFSSSQYSICFTLGSFAKIYADTFGDINYQEFAKRLWGDI YFNPKTR 66 

M: : I Mil: II: I 1 : t II I I ! I I I : I I I : 

Db 222 DEEIYFAPEKNNVIFCSAIDGWAFTVRQFAALYERKLG-IKRSILEKVLWGDFYLDPKTK 280 

Qy 67 KF — TKKAPTSSSQRSFVEFILEPLY KILAQWGDVDTSLP RTLDELGI 113 

: | : : | | : : | : : : : | : : : : : : i I : I I 

Db 281 RVLGPKHLKGRALKPMFVQLVLDSIWAAYEATTALLEKITKSLNITI PPYVLRSRDPRNI 340 

Qy 114 HLTKEELKLNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGG 168 

: I : I M I : : : : : I I I : : I 

Db 341 MMTLFSMWL PL S T AVLVS VI E YL P S P P AAQAT RL PGLIEGSP GAGF 386 

Qy 169 VDSDLGEAM — SDCDPDGPLMCHTTKMF 194 

II : I I I I I : : : : I I 

Db 387 VDKKVKEAMVAFKTGTDAPWAYVSKMVAIPESELLSSKKRSGATLSADEAREIARKKRE 446 

Qy 195 STHDG-VQFHP FGRVLSGTI 213 

: I I I I I I : III: 

Db 4 47 EIAKMQAEAGGNGNEMDDYARVTSAFEVTTLDDGEEQGEPEDKEDPEHLVGFARLYSGTL 506 

Qy 214 HAGQPVKVLGENYTLED E ED S Q I CT VGRLW I S VARYH I EVNRVP AGNWVL I EGVDQ P 270 

I III : : I : I II I : : : I : MM I M 

Db 507 SVGDEVYVIAPKFS PEHPHAHPEPQKTVTVTDLYLLMGRSLEPLKTVPAGVI FGI GGLAGH 566 

Qy 271 IVKTATITEPRGNEEAQIFRPLKFNTTSVIKIAVEPVNPSELPKMLDGLRKVNKSYPSLT 330 

|:| I : I : : I : : : : : : I I" I I : : I I I : M I : : I I 

Db 567 ILKNGTLCSQL — EGSINLAGVS LNAPPI VRVSLEPANPADLNKMVTGLRLLEQSDPCAQ 624 

Qy 331 TKVEESGEHVILGTGELYLDCVMHDLRKMYSEIDIKVADPVVTFCETWETSSLK 385 

: I I I I I I II li I : I : : | | | : : : : : I : : I : M : I : 

Db 62 5 YEVLPSGEHVILTAGELHLERCIKDLRERFAKCEIQTGQTIVPYRETI VSAPEMAPPKKP 68 4 

Qy 38 6 CFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWNRKKLGEFFQTK 433 

I : I : ! : : : II : : : I I I : : Ml 

Db 685 DLGRGCVLAVSPSKQLTVKLRWPLPEAVTDFISKNVGTIKRLQSEKRSAGETKSD.AJCAS 744 

Qy 434 YDWDLLAARS I W AFGPDATGPNI LVDDTLPSEVDKA 4 69 

: : : : : : I II I I I I I I II I I : : : I 

Db 745 NGIFNEEVKEGKELWANWDRITAFGPRRVGPNILVDATWNTCENA 791 



RESULT 11 

US-1 0-12 8-714-8410 

; Sequence 8410, Application US/10128714 

; Publication No. US20030119013A1 

; GENERAL INFORMATION: 

; APPLICANT: Jiang, Bo 

; APPLICANT: Hu, Wenqi 

; APPLICANT: Tishkoff, Daniel 

; APPLICANT: Zamudio, Carlos 



APPLICANT: Eroshkin, Alexey M 
APPLICANT: Lemieux, Sebastien M 

TITLE OF INVENTION: Identification of Essential Genes in Aspergillus 
f umigatus and 

TITLE OF INVENTION: Methods of Use 
FILE REFERENCE: 10182-018-9 99 
CURRENT APPLICATION NUMBER: US/10/128, 714 
CURRENT FILING DATE: 2002-04-23 
PRIOR APPLICATION NUMBER: US 60/285,697 
PRIOR FILING DATE: 2001-04-23 
PRIOR APPLICATION NUMBER: US 60/287,066 
PRIOR FILING DATE: 2001-04-27 
PRIOR APPLICATION NUMBER: US 60/295,890 
PRIOR FILING DATE: 2001-06-05 
PRIOR APPLICATION NUMBER: US 60/303,899 
PRIOR FILING DATE: 2001-07-09 
PRIOR APPLICATION NUMBER: US 60/316,362 
PRIOR FILING DATE: 2001-08-31 
NUMBER OF SEQ ID NOS : 8603 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 8410 
LENGTH: 1087 
TYPE: PRT 

ORGANISM: Aspergillus f umigatus 
US- 10- 12 8-714-8 410 

Query Match 15.0%; Score 379; DB 15; Length 1087; 

Best Local Similarity 21.8%; Pred. No. 2.3e-28; 

Matches 139; Conservative 98; Mismatches 218; Indels 184; Gaps 16; 

DENLILSPLLGNVCFSSSQYSICFTLGSFTVKI YADTFGDINYQEFAKRLWGDIYFNPKTR 6 6 

| i : : I Mil: I I : I I : i II ] I I 1 I t : I 1 I : 

DEEIYFAPEKNNVIFCSAIDGWAFTVRQFAALYERKLG-IKRSILEKVLWGDFYLDPKTK 300 

KF — TKKAPTSSSQRSFVEFILEPLY KI LAQWGDVDT S LP 105 

: | : : | | : : | : : : : | : : : : : : I 



I I I 



Qy 


7 


Db 


242 


Qy 


67 


Db 


301 


Qy 


106 


Db 


361 


Qy 


165 


Db 


407 


Qy 


195 


Db 


467 


Qy 


205 


Db 


527 


Qy 


262 


Db 


587 



-VDSDLGEAM — SDCDPDGPLMCHTTKMF 194 

II : I I I | | : : : : | | 



-STHDG-VQFHP 204 

: I I I I 



I : I I I : I III : : I : I II I : : : I : II 



||: I : I I : I : : I :::::: I I I I :: I I I : I I I 



Qy 322 VNKS YPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEIDIKVAD PWTFCETWET 381 

: : I I : I I I I I 1 I I I I I : I : : I I I : :: : : I : : I : I I : I 

Db 645 LEQSDPCAQYEVLPSGEHVILTAGELHLERCIKDLRERFAKCEIQTGQTIVPYRETIVSA 704 

Qy 382 SSLK CFAETPNKKNKITMIAEPLEKGLAEDIENEVVQITWNRKKLGEFF 430 

: | : | : | : : : | | : : : | I [ : : 

Db 705 PEMAPPKKPDLGRGCVLAVSPSKQLTVKLRWPLPEAVTDFISKNVGTIKRLQSEKRSAG 764 

Qy 431 qtk YDWDLLAARSIW — 445 

:|| :: :: : :| 

Db 765 ETKSDAKASNGSLETTQQAESGDASGEAREGSQLSLEDFKKELTRIFNEEVKEGKELWAN 824 

Qy 446 AFGPDATGPNILVDDTLPSEVDKALLGSVKDSI 478 

Ml! I I I ! ! I I I : : : I I I I - 

Db 825 VVDRITAFGPRRVGPNILVDATVVNTCEKCLLDDPKQQL 8 63 



RESULT 12 
US-09-927-738-2 

; Sequence 2, Application US/09927738 
; Patent No. US2 002 007 67 9 9Al 
; GENERAL INFORMATION: 

APPLICANT: Wang, Tongwen 

TITLE OF INVENTION: Compos is tions and Methods of modulating TGF-B Signaling 
FILE REFERENCE: 17633/1082 

CURRENT APPLICATION NUMBER: US/09/927, 738 

CURRENT FILING DATE: 2001-08-10 

PRIOR APPLICATION NUMBER: US 60/119786 

PRIOR FILING DATE: 1999-02-11 
; PRIOR APPLICATION NUMBER: PCT/US 00/ 035 61 
; PRIOR FILING DATE: 2000-02-11 
; NUMBER OF SEQ ID NOS : 28 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 2 
; LENGTH: 54 3 

TYPE : PRT 
; ORGANISM: Unknown 
; FEATURE : 

OTHER INFORMATION: Clone Si + 28 protein 
US-09-927-738-2 

Query Match 12.9%; Score 325; DB 9; Length 543; 

Best Local Similarity 25.1%; Pred. No. 2e-23; 

Matches 85; Conservative 66; Mismatches 103; Indels 84; Gaps 7 

Qy 2 02 FHPFGRVLSGTIHAGQPVKVLGENYT LEDEEDSQICTVGRLW 2 43 

1 ! I I I I | : : | I i I : I I : I : 

Db 88 FIAFARVFSGVARRGKKI FVLGPKYSPLEFLRRVPLCFSAPPDGLPQVPHMAYCALENLY 147 

Qy 244 ISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIA 303 

: : I : I I I I : I I : : : | : I 1 : I I I I I : : : : I 

Db 148 LLMGRELEYLEEVPPGNVLGIGGLQDFVLKSATLC SLPSCPPFI PLNFEATPIVRVA 204 

Qy 304 VEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEI 363 

I I I : I I I : I : : : I : : : I : : \ : : : I : i ! I i : : I I : : I : | | : : : : : I 
Db 2 05 VEPKHPSEMPQLVKGMKLLNQADPCVQILIQETGEHVLVTAGEVHLQRCLDDLKERFAKI 2 64 



Qy 364 D I KVAD PWT FC ET WET SSLKC FAE 389 

I i : : I : : I M : : : I 

Db 265 HISVSEPIIPFRETITKPPKVDMVNEEIGKQQKVAVIHQMKEDQSKIPEGIQVDSDGLIT 324 

Qy 390 — TPNKKNKITMIAEPLEKGLAE DI ENEWQITWN RKKLG 427 

Mil : : : | | I : : : | : : I : i : : : ! : 

Db 325 ITTPNKLATLSVRAMPLPEEVTQILEENSDLIRSMEQLTSSLNEGENTHMIHQKTQEKIW 384 

Qy 428 EFFQTKYDWDLLAAR SIWAFGPDATGPNILVD 459 

I I : I : I I 1 I : I M MINI: 

Db 385 E- FKGKLEQHLTGRRWRNI VDQIWS FGPRKCGPNI LVN 421 



RESULT 13 

US-09-8 15-2 42-52 38 

Sequence 5238, Application US/09815242 
Patent No. US2 002 0061569A1 
GENERAL INFORMATION: 
APPLICANT: Haselbeck, Robert 
APPLICANT: Ohlsen, Kari L . 
APPLICANT: Zyskind, Judith W. 
APPLICANT : Wall, Daniel 
APPLICANT: Trawick, John D. 
APPLICANT: Carr, Grant J. 
APPLICANT: Yamamoto, Robert T. 
APPLICANT: Xu, H. Howard 

TITLE OF INVENTION: Identification of Essential Genes in 
TITLE OF INVENTION: Prokaryotes 
FILE REFERENCE: ELITRA. 0 11A 

CURRENT APPLICATION NUMBER: US/ 09/ 8 15 , 242 
CURRENT FILING DATE: 2001-03-21 
PRIOR APPLICATION NUMBER: 60/191,078 
PRIOR FILING DATE: 2000-03-21 
PRIOR APPLICATION NUMBER: 60/206,848 
PRIOR FILING DATE: 2000-05-23 
PRIOR APPLICATION NUMBER: 60/207,727 
PRIOR FILING DATE: 2000-05-26 
PRIOR APPLICATION NUMBER: 60/242,578 
PRIOR FILING DATE: 2000-10-23 
PRIOR APPLICATION NUMBER: 60/253,625 
PRIOR FILING DATE: 2000-11-27 
PRIOR APPLICATION NUMBER: 60/257,931 
PRIOR FILING DATE: 2000-12-22 
PRIOR APPLICATION NUMBER: 60/269,308 
PRIOR FILING DATE: 2001-02-16 
NUMBER OF SEQ ID NOS : 14110 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 5238 
LENGTH: 693 
TYPE: PRT 

ORGANISM: Staphylococcus aureus 
US-09-815-242-5238 

Query Match 7.4%; Score 188; DB 9; Length 693; 

Best Local Similarity 22.8%; Pred. No. 1.8e-09; 

Matches 96; Conservative 70; Mismatches 161; Indels 94; Gaps 16 



Qy 77 SQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLNIRPLLRLVCKKFF 136 

: : | : i : I : : : : : I I : I : I I I I I I : I : 

Db 210 ARASLIEAVAETSDELMEKYLGDEEISV S ELKEAI RQATTNV- - E FY 254 

Qy 137 GEFTG FVDMCVQHI PSPKVGAKPKI EHTYTGGVDSDLGEAMSDCDPDGPL 186 

I : | : : : | | | : | | | I : : I : : i 

Db 255 P VL C GT AF KN K G VQ LMLD AVI DYLPSP-L D VK P 1 1 GH RAS N P E E EVIAKADDSAEF 309 

Qy 187 MCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLWISV 246 

j : I I I I ! I E I : : I II II I Mil 
Db 310 AALAFKVM-TDPYVGKLTFFRVYSGTMTSGSYVK NSTKGKRE RVGRLLQMH 359 

Qy 247 ARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIAVEP 306 

1 | : : | : | : | : I I I : I : : I M -III 

Db 360 ANSRQEIDTVYSGDIAAAVGLKD TGTGDTLCGEKNDIILESMEF-PEPVIHLSVEP 414 

Qy 307 WPSELPKMLDGLRKVNKSYPSLTTKV-EESGEHVILGTGELYLDCVMHDLRKMYSEIDI 365 

: : : | | I I : : I : II : I : M I I I I : I I : : : : I : : : : 

Db 415 KSKADQDKMTQALVKLQEEDPTFHAHTDEETGQVI IGGMGELHLDILVDRMKKEFN-VEC 473 

Qy 366 KVADPVVT FCETWET SSLKCFAET PNKKNKI TMI AEPLEKGLAEDI ENEWQI TWNRKK 425 

I MM: M II: I • = : M Ml 
Db 474 NVGAPMVSYRET FKSSAQVQGKFSR QSGGRGQYGDVHIE 512 

Qy 426 LGEFFQTKYDWDLLAARSIWAFGPDATG PNILVDDTLPSEVDKALLGSVKDSIVQ 480 

I I : I I I : I : I I : : MM: 

Db 513 FTPNETGAGFEFENAIVGGWPREYIPSVEAGLKDAMEN 551 

Qy 481 G 481 

! 

Db 552 G 552 



RESULT 14 

US- 09- 8 15-2 4 2-12 4 43 

Sequence 12443, Application US/09815242 
Patent No. US2002 0061569A1 
GENERAL INFORMATION: 
APPLICANT: Haselbeck, Robert 
APPLICANT: Ohlsen, Kari L. 
APPLICANT: Zyskind, Judith W. 
APPLICANT: Wall, Daniel 
APPLICANT: Trawick, John D. 
APPLICANT: Carr, Grant J. 
APPLICANT: Yamamoto, Robert T. 
APPLICANT: Xu, H. Howard 

TITLE OF INVENTION: Identification of Essential Genes in 
TITLE OF INVENTION: Prokaryotes 
FILE REFERENCE: ELITRA. 011A 

CURRENT APPLICATION NUMBER: US/09/815, 242 
CURRENT FILING DATE: 2001-03-21 
PRIOR APPLICATION NUMBER: 60/191,078 
PRIOR FILING DATE: 2000-03-21 
PRIOR APPLICATION NUMBER: 60/206,848 
PRIOR FILING DATE: 2000-05-23 
PRIOR APPLICATION NUMBER: 60/207,727 



PRIOR FILING DATE: 2000-05-26 

PRIOR APPLICATION NUMBER: 60/242,578 

PRIOR FILING DATE: 2000-10-23 

PRIOR APPLICATION NUMBER: 60/253,625 

PRIOR FILING DATE: 2000-11-27 

PRIOR APPLICATION NUMBER: 60/257,931 

PRIOR FILING DATE: 2000-12-22 

; PRIOR APPLICATION NUMBER: 60/269,308 
PRIOR FILING DATE: 2001-02-16 

; NUMBER OF SEQ ID NOS : 14110 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 12443 

LENGTH: 715 
; TYPE: PRT 

; ORGANISM: Staphylococcus aureus 
US-09-8 15-242-12 4 43 



Query Match 7.4%; Score 188; DB 9; Length 715; 

Best Local Similarity 22.8%; Pred. No. 1.9e-09; 

Matches 96; Conservative 70; Mismatches 161; Indels 94; Gaps 16 

Qy 77 SQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLNIRPLLRLVCKKFF 136 

: : | : I : I : : : : : I I : I : E I I I E I : I : 

Db 232 ARASLI EAVAETS DELMEKYLGDEEI SV S ELKEAI RQATTNV — EFY 276 



137 GEFTG FVDMCVQHI PSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDPDGPL 186 

I : I : : : I I I : ! i I I : : I : : I 

277 PVLCGTAFKNKGVQLMLDAVIDYLPSP-LDVKPIIGHRASNPEE EVIAKADDSAEF 331 

187 MCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLWISV 24 6 

I : I I ! I I t I I : : I I I II I MM 

332 AALAFKVM-TDP YVGKLT FFRVYS GTMT S GS YVK NSTKGKRE RVGRLLQMH 381 



Qy 247 ARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIAVEP 306 

| | : : | : | : | : II I : I : : i II —III 

Db 382 ANSRQEIDTVYSGDIAAAVGLKD TGTGDTLCGEKNDIILESMEF-PEPVIHLSVEP 436 

Qy 307 WPSELPKMLDGLRKVNKSYPSLTTKV-EESGEHVILGTGELYLDCVMHDLRKMYSEIDI 365 

: : : | | I I : : I : I I : I : : I E M I : I I : : : : I : : : : 

Db 437 KSKADQDKMTQALVKLQEEDPTFHAHTDEETGQVIIGGMGELHLDILVDRMKKEFN-VEC 495 

Qy 366 KVADPVVTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWNRKK 425 

t I : I : : I I I I : I : : : : I Ml 

Db 496 NVGAPMVSYRET FKSSAQVQGKFSR QSGGRGQYGDVHIE 534 

Qy 426 LGEFFQTKYDWDLLAARSIWAFGPDATG PNILVDDTLPSEVDKALLGSVKDSIVQ 480 

I I : IE I : I Ml : : MM: 

Db 535 FTPNETGAGFEFENAIVGGWPREYI PSVEAGLKDAMEN 573 

Qy 481G481 
1 

Db 574 G 574 



RESULT 15 

US-09-815-2 42-12 7 36 

; Sequence 12736, Application US/09815242 



Patent No. US2 002 00 61569A1 
GENERAL INFORMATION: 
APPLICANT: Haselbeck, Robert 
APPLICANT: Ohlsen, Kari L. 
APPLICANT: Zyskind, Judith W. 
APPLICANT: Wall, Daniel 
APPLICANT: Trawick, John D. 
APPLICANT: Carr, Grant J. 
APPLICANT: Yamamoto, Robert T. 
APPLICANT: Xu, H. Howard 

TITLE OF INVENTION: Identification of Essential Genes in 
TITLE OF INVENTION: Prokaryotes 
FILE REFERENCE: ELITRA. 01 1A 

CURRENT APPLICATION NUMBER: US/ 09/815, 242 
CURRENT FILING DATE: 2001-03-21 
PRIOR APPLICATION NUMBER: 60/191,078 
PRIOR FILING DATE: 2000-03-21 
PRIOR APPLICATION NUMBER: 60/206,848 
PRIOR FILING DATE: 2000-05-23 
PRIOR APPLICATION NUMBER: 60/207,727 
PRIOR FILING DATE: 2000-05-26 
PRIOR APPLICATION NUMBER: 60/242,578 
PRIOR FILING DATE: 2000-10-23 
PRIOR APPLICATION NUMBER: 60/253,625 
PRIOR FILING DATE: 2000-11-27 
PRIOR APPLICATION NUMBER: 60/257,931 
PRIOR FILING DATE: 2000-12-22 
PRIOR APPLICATION NUMBER: 60/269,308 
PRIOR FILING DATE: 2001-02-16 
NUMBER OF SEQ ID NOS : 14110 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 12736 
LENGTH: 715 
TYPE: PRT 

ORGANISM: Staphylococcus aureus 
US- 09- 8 15-2 42-127 36 

Query Match 7.4%; Score 188; DB 9; Length 715; 

Best Local Similarity 22.8%; Pred. No. 1.9e-09; 

Matches 96; Conservative 70; Mismatches 161; Indels 94; Gaps 16 

Qy 77 SQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLNIRPLLRLVCKKFF 136 

: : | : t : I : : : : : M : I : Mil! I : I : 

Db 232 ARASLIEAVAETSDELMEKYLGDEEISV S ELKEAI RQATTNV — EFY 276 

Qy 137 GEFTG FVDMCVQHIPS PKVGAKPKIEHTYTGGVDSDLGEAMSDCDPDGPL 186 

| : I : : : I It : I I I I : : I : : I 

Db 277 PVLCGTAFKNKGVQLMLDAVIDYLPSP-LDVKPIIGHRASNPEE EVIAKADDSAEF 331 

Qy 187 MCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLWISV 246 

I : I I I I I I 11 : : I I I II I MM 
Db 332 AALAFKVM-TDPYVGKLTFFRVYSGTMTSGSYVK NSTKGKRE RVGRLLQMH 381 

Qy 247 ARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIAVEP 306 

| | : : | : | : I : II I : ! : : I II —Ml 

Db 382 ANS RQEI DTVYSGDIAAAVGLKD TGTGDTLCGEKNDIILESMEF-PEPVIHLSVEP 436 



Qy 307 VNPSELPKMLDGLRKVNKSYPSLTTKV-EESGEHVILGTGELYLDCVMHDLRKMYSEIDI 365 

: : : | | I I : : I : I I : I : : I I I I I : I I : : : : I : : : : 

Db 437 KSKADQDKMTQALVKLQEEDPTFHAHTDEETGQVIIGGMGELHLDILVDRMKKEFN-VEC 495 

Qy 366 KVADPWTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWNRKK 425 

| I : | : : I [ I I : ! : : : : I 1=1 

Db 496 NVGAPMVSYRET FKSSAQVQGKFSR QSGGRGQYGDVHIE 534 

Qy 426 LGEFFQTKYDWDLLAARSIWAFGPDATG PNILVDDTLPSEVDKALLGSVKDSIVQ 480 

i t : I I I : I : I I : : : I I : : 

Db 535 FTPNETGAGFEFENAIVGGWPREYIPSVEAGLKDAMEN 573 

Qy 481 G 481 

I 

Db 574 G 574 



Search completed: January 30, 2004, 11:35:27 
Job time : 56.2763 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: January 30, 2004, 11:17:27 ; Search time 57.0837 Seconds 

(without alignments) 

2174.410 Million cell updates/sec 

Title : US- 09- 9 8 9-4 8 1-3 

Perfect score: 2527 

Sequence: 1 RAYYSTDENLILSPLLGNVC LPSEVDKALLGSVKDSIVQG 481 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 830525 seqs, 258052604 residues 

Total number of hits satisfying chosen parameters: 830525 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : SPTREMBL 23:* 



1 




sp archea:* 


2 




sp bacteria : * 


3 




sp fungi : * 


4 




sp human : * 


5 




sp invertebrate:* 


6 




sp mammal : * 


7 




sp mhc : * 


8 




sp organelle : * 


9 




sp phage : * 


10 


sp plant : * 


11 


sp rodent:* 


12 


sp virus : * 


13 


sp vertebrate:* 


14 


sp unclassified:* 


15 


: sp rvirus : * 


16 


: sp bacteriap:* 


17 


: sp archeap:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


2491 


98 . 


6 


850 


4 


Q8IXJ3 


Q8ixj3 homo sapien 


2 


2409 


95. 


3 


974 


13 


Q8AVN2 


Q8avn2 xenopus lae 


3 


1848 


73. 


1 


975 


5 


Q 9 VAX 8 


Q9vax8 drosophila 


4 


1764.5 


69. 


8 


974 


5 


Q23463 


Q23463 caenorhabdi 


5 


1754.5 


69. 


4 


849 


5 


Q19070 


Ql 9 07 0 caenorhabdi 


6 


1528.5 


60. 


5 


987 


10 


Q9LNC5 


Q91nc5 arabidopsis 


7 


1335 


52 . 


8 


983 


3 


094316 


094316 schizosacch 


8 


1062 


42. 


0 


1235 


5 


Q8I JZ9 


Q8ijz9 Plasmodium 


9 


903 . 5 


35. 


8 


728 


5 


Q9BNW7 


Q9bnw7 scolopendra 


10 


881.5 


34 . 


9 


726 


5 


Q9BNX1 


Q9bnxl nipponopsal 


11 


872.5 


34 . 


5 


660 


5 


Q9BNW9 


Q9bnw9 polyxenus f 


12 


869 


34 . 


4 


858 


11 


Q8C153 


Q8cl53 mus musculu 


13 


867 


34 . 


3 


858 


11 


Q60423 


Q6042 3 cricetulus 


14 


863 


34 . 


2 


727 


5 


Q9BNW0 


Q9bnw0 peripatus s 


15 


8 62 . 5 


34. 


1 


844 


5 


Q95P39 


Q95p39 aedes aegyp 


16 


8 62 . 5 


34. 


1 


844 


5 


Q8T4R9 


Q8t4r9 aedes aegyp 


17 


862 


34 . 


1 


726 


5 


Q9BNX2 


Q9bnx2 mastigoproc 


18 


861 


34 . 


1 


702 


5 


Q9BNX6 


Q9bnx6 endeis laev 


19 


856.5 


33. 


9 


844 


5 


Q9BME7 


Q9bme7 aedes aegyp 


20 


855 


33. 


8 


858 


11 


Q8BMA8 


Q8bma8 mus musculu 


21 


854.5 


33. 


8 


844 


5 


Q8T4S0 


Q8t4s0 aedes aegyp 


22 


853 . 5 


33, 


8 


658 


5 


Q9BNX5 


Q9bnx5 hutchinsoni 


23 


851. 5 


33 


7 


633 


5 


Q9BNY0 


Q9bny0 artemia sal 


24 


851 


33 


7 


727 


5 


Q9BNW8 


Q9bnw8 scutigerell 


25 


844 


33 


4 


703 


5 


Q9BNW2 


Q9bnw2 milnesium t 


26 


839 


33 


2 


813 


3 


Q9P4S4 


Q9p4s4 Candida alb 


27 


838.5 


33 


2 


658 


5 


Q9BNX4 


Q9bnx4 limulus pol 


28 


838 . 5 


33 


2 


726 


5 


Q9BNX7 


Q9bnx7 eumesocampa 


29 


837.5 


33 


1 


658 


5 


Q9BNW5 


Q9bnw5 tomocerus s 


30 


836.5 


33 


1 


633 


5 


Q9BNX3 


Q9bnx3 machiloides 


31 


836 


33 


1 


841 


5 


Q95UT8 


Q95ut8 monosiga br 


32 


835 


33 


0 


813 


3 


Q9P4S0 


Q9p4s0 clavispora 


33 


830. 5 


32 


9 


637 


5 


Q9BNW6 


Q9bnw6 speleonecte 


34 


829 


32 


8 


813 


3 


Q9P4R9 


Q9p4r9 Candida par 


35 


828.5 


32 


. 8 


633 


5 


Q9BNX8 


Q9bnx8 semibalanus 


36 


826 


32 


.7 


813 


3 


Q9P4S1 


Q9p4sl Candida tro 


37 


816. 5 


32 


. 3 


656 


5 


Q9BNW1 


Q9bnwl nereis vire 


38 


816 


32 


. 3 


789 


10 


Q9M655 


Q9m655 euglena gra 


39 


808 


32 


. 0 


814 


3 


Q9P4S3 


Q9p4s3 Candida gla 


40 


804 


31 


. 8 


838 


3 


Q9HFZ8 


Q9hfz8 cryptococcu 


41 


803 


31 


. 8 


760 


5 


Q9NIH0 


Q9nih0 stylonychia 


42 


802 


31 


.7 


753 


5 


Q9NDT2 


Q9ndt2 Plasmodium 


43 


802 


31 


.7 


832 


5 


Q8IKW5 


Q8ikw5 Plasmodium 


44 


801.5 


31 


.7 


726 


5 


Q9BNW4 


Q9bnw4 tanystylum 


45 


793. 5 


31 


. 4 


665 


10 


Q8H145 


Q8hl45 arabidopsis 



ALIGNMENTS 



RESULT 1 
Q8IXJ3 

ID Q8IXJ3 PRELIMINARY 
AC Q8IXJ3; 

DT 01-MAR-2003 (TrEMBLrel. 
DT 01-MAR-2003 (TrEMBLrel. 
DT 01-MAR-2003 (TrEMBLrel. 



PRT; 850 AA. 
23, Created) 

23, Last sequence update) 
23, Last annotation update) 



DE Small nuclear ribonucleoprotein component. 

GN SNRP116. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia ; Eutheria ; Primates ; Catarrhini ; Hominidae ; Homo . 

OX NCBI_TaxID=9 606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Lennerz V., Fatho M. , Gentilini C, Lifke A., Woelfel C, Woelfel T.; 

RT "Response of autologous T cells to a human melanoma is dominated by 

RT individual mutated antigens."; 

RL Submitted (AUG-2 002) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=96051387; PubMed=7 5 84 02 6; 

RA Nomura N., Miyajima N., Sazuka T., Tanaka A. , Kawarabayasi Y. , 

RA Sato S., Nagase T . , Seki N., Ishikawa K. , Tabata S.; 

RT "Prediction of the coding sequences of unidentified human genes. I. 

RT The coding sequences of 40 new genes ( KIAA0001-KIAA004 0 ) deduced by 

RT analysis of randomly sampled cDNA clones from human immature myeloid 

RT cell line KG-1-"; 

RL DNA Res. 1:27-35(1994). 

DR EMBL; AJ505017; CAD43720.1; 

KW Nucleocapsid; Ribonucleoprotein. 

SQ SEQUENCE 850 AA; 95371 MW; D25DAEB8 1 93D25D8 CRC64; 

Query Match 98.6%; Score 2491; DB 4; Length 850; 

Best Local Similarity 99.4%; Pred. No. 4.5e-192; 

Matches 475; Conservative 1; Mismatches 2; Indels 0; Gaps 0; 

YSTDENLILSPLLGNVCFSSSQYSICFTLGSFAKIYADTFGDINYQEFAKRLWGDIYFNP 63 

I | | M i I I I I I I I I I I I I I 1 I I I I 1 I I 1 ! I I ! I I I I II I i MINIM 

YSTDENLILSPLLGNVCFSSSQYSICFTLGSFAKIYADTFGDINYQEFAKRLWGDIYFNP 22 9 

KTRKFTKKAPTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLN 123 

M II M M I M M II II M II I M M M II I I M II I M II I I II II II II M I 

KTRKFTKKAPTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLN 289 

IRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDPD 183 

I II II I II II II M I I I M M I M 1 I M I M M M II II II Mill 

IRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDPD 34 9 

GPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLW 2 43 

M II I 1 I I I M II I 1 II I I I II I II M II II II II I M II I I I II II I II I M I I I II 
GPLMCHTTKMYSTDDGVQFHAFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLW 4 09 

I SVARYHI EVNRVPAGNWVLI EGVDQPI VKTAT1TEPRGNEEAQI FRPLKFNTT S VI KI A 3 03 

I | || II II II I I II II II II II II II I I I II I II I I II II II II I M I II II M M I I I I 

ISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIA 4 69 



Qy 


4 


Db 


170 


Qy 


64 


Db 


230 


Qy 


124 


Db 


290 


Qy 


184 


Db 


350 


Qy 


244 


Db 


410 


Qy 


304 


Db 


470 



II I I II I I M II II M I II II II II I I I I II I II I II II II I I II II II I I M I 



363 
529 



Qy 

Db 



364 DIKVADPVVTFCET VVETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEVVQITWNR 423 

IE II I II II II I M II I I I I M II I M M II I II II I I II II I II II I I II I II I I I M I 
530 DIKV7VDPVVTFCETVVETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEVVQITWNR 58 9 



Qy 424 KKLGEFFQTKYDWDLLTiARSIWAFGPDATGPNILVDDTLPSEVDKALLGSVKDSIVQG 4 81 

II I I I I I I I I I M i I I I M I I I I I II I i I I I I I I I M I I I I I I ! I I I II I ! I M I i t I 

Db 590 KKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGSVKDSIVQG 647 



RESULT 2 
Q8AVN2 



ID Q8AVN2 PRELIMINARY; PRT; 974 AA. 

AC Q8AVN2; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Similar to U5 snRNP-specif ic protein, 116 kD . 

OS Xenopus laevis (African clawed frog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Amphibia; Batrachia; Anura; Mesobatrachia ; Pipoidea; Pipidae; 

OC Xenopodinae; Xenopus. 

OX NCBI_TaxID=8355; 

RN [1] 

RP SEQUENCE FROM N . A. 

RC TISSUE=Embryo; 

RA Klein S., Strausberg R. ; 

RL Submitted (DEC-2 002) to the EMBL/GenBank/DDB J databases. 

DR EMBL; BC041724; AAH41724.1; -. 

SQ SEQUENCE 974 AA; 109777 MW; 35B07DCA5B23FE9D CRC64; 



Query Match 95.3%; Score 2409; DB 13; Length 974; 

Best Local Similarity 94.6%; Pred. No. 2.3e-185; 

Matches 452; Conservative 17; Mismatches 9; Indels 0; Gaps 



Qy 


4 


YSTDENLILSPLLGNVCFSSSQYSICFTLGSFAKIYADTFGDINYQEFAKRLWGDI YFNP 


63 




I ! I 1 1 1 1 1 IE 1 1 1 1 I 1 M : 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 I 1 1 1 1 I 1 1 1 1 1 M 




Db 


294 


YSTDENLILSPLLGNVCFASSQYSICFTLGSFAKIYADTYGDINYQEFAKRLWGDI YFNP 


353 


Qy 


64 


KTRKFTKKAPTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLN 


123 




I 1 1 M I 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 : 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 : 1 1 : 1 I : 1 M M M 1 1 1 1 1 1 1 




Db 


354 


KTRKFTKKAPTS S SQRS FVEFVLEPLYKI LAQWGDVDTTLPQTLEELGI HLTKEELKLN 


413 


Qy 


124 


IRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDPD 


183 




I I I 1 : 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 II M 1 II: 1 1 1 1 1 1 1 M : M : 1 1 1 1 1 : 1 1 1 1 




Db 


414 


IRPLLRLVCNRFFGEFTGFVDMCVQHIPSPKAGARAKIEHTYTGGIDSELGEVMSECDPD 


473 


Qy 


184 


GPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLW 


243 




|||MIIIII:M Mill 1 1 1 : 1 M IE 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 : 1 1 : 1 M 1 




Db 


474 


GPLMCHTTKMYSTDDGVQFRAFGRLLSGTIHAGQ PVKVLGENYTLEDEEDSQVCTIGRLW 


533 


Qy 


244 


ISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIA 


303 




: I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IE 1 1 I 1 ! 1 t 1 1 1 f 1 1 1 1 1 t 1 II 1 1 1 f 1 1 1 1 1 1 1 I 1 




Db 


534 


VSVARYHIEWRVPAGNWLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIA 


593 


Qy 


304 


VEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEI 


363 




1 I I I IE 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IE 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 II 1 1 1 E 1 1 t 1 1 1 1 1 1 1 II 1 1 1 




Db 


594 


VEPWPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEI 


653 


Qy 


364 


DIKVADPVVTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEVVQITWNR 


423 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I : M I 



Db 654 DIKVADPVVTFCETVVETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEVVQI SWNR 713 

Qy 424 KKLGEFFQTKYDWDLIAARSIWAFGPDATGPNILVDDTLPSEVDKALLGSVKDSIVQG 481 

I j | | ! I I I I M I I I i I I I I I I I I I M I I I I I I I M I ! M I I I I I I i I I I I 

Db 714 KKLGEFFQTKYDWDLLAARSIWAFGPDTTGPNILVDDTLPSEVDKALLSSVKDSIVQG 771 



RESULT 3 
Q 9 VAX 8 

ID Q 9 VAX 8 PRELIMINARY; PRT; 975 AA. 

AC Q 9 VAX 8 ; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last annotation update) 

DE CG4849 protein (LD28793p) . 

GN CG4849. 

OS Drosophila melanogas ter (Fruit fly) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota ; Diptera; Brachycera; Muscomorpha; 

OC Ephydroidea; Drosophilidae ; Drosophila. 

OX NCBI_TaxID=7227; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BERKELEY ; 

RX ME DLINE= 20196006; PubMed= 10731132; 

RA Adams M.D., Celniker S.E., Holt R.A. , Evans C.A. , Gocayne J.D., 

RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A. , Galle R.F., 

RA George R. A. , Lewis S.E., Richards S., Ashburner M., Henderson S.N., 

RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., 

RA Brandon R.C., Rogers Y.-H.C, Blazej R.G., Champe M. , Pfeiffer B.D., 

RA Wan K.H., Doyle C, Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., 

RA Abril J.F., Agbayani A., An H.-J., Andrews-Pf annkoch C, Baldwin D., 

RA Ballew R.M., Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M., 

RA Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S., 

RA Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P., 

RA Burtis K.C., Busam D.A., Butler H., Cadieu E., Center A., Chandra I., 

RA Cherry J.M., Cawley S., Dahlke C, Davenport L.B., Davies P., 

RA de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., 

RA Dodson K., Doup L.E., Downes M. , Dugan-Rocha S., Dunkov B.C., Dunn P. 

RA Durbin K.J., Evangelista C.C., Ferraz C, Ferriera S., Fleischmann W. 

RA Fosler C, Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K . , 

RA Glodek A., Gong F. , Gorrell J.H., Gu Z . , Guan P., Harris M. , 

RA Harris N.L., Harvey D., Heiman T.J., Hernandez J.R., Houck J., 

RA Hostin D., Houston K.A., Howland T.J., Wei M.-H., Ibegwam C, 

RA Jalali M. , Kalush F., Karpen G.H., Ke Z . , Kennison J. A., Ketchum K.A. 

RA Kimmel B.E., Kodira CD., Kraft C, Kravitz S., Kulp D., Lai Z., 

RA Lasko P., Lei Y. , Levitsky A. A. , Li J., Li Z., Liang Y., Lin X., 

RA Liu X., Mattel B., Mcintosh T.C., McLeod M.P., McPherson D., 

RA Merkulov G., Milshina N.V., Mobarry C, Morris J., Moshrefi A., 

RA Mount S.M., Moy M. , Murphy B., Murphy L., Muzny D.M., Nelson D.L., 

RA Nelson D.R., Nelson K.A. , Nixon K., Nusskern D.R., Pacleb J.M. , 

RA Palazzolo M. , Pittman G.S., Pan S., Pollard J., Puri V., Reese M.G., 

RA Reinert K., Remington K. , Saunders R.D.C., Scheeler F . , Shen H., 

RA Shue B.C., Siden-Kiamos I., Simpson M. , Skupski M.P., Smith T., 

RA Spier E . , Spradling A.C., Stapleton M. , Strong R. , Sun E . , 

RA Svirskas R- , Tector C, Turner R. , Venter E., Wang A.H., Wang X., 

RA Wang Z.-Y., Wassarman D.A. , Weinstock G.M. , Weissenbach J., 



RA Williams S.M., Woodage T . , Worley K.C., Wu D., Yang S., Yao Q.A., 

RA Ye J., Yeh R.-F., Zaveri J.S., Zhan M. , Zhang G. , Zhao Q. , Zheng L . , 

RA Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S., Zhu X., Smith H.O., 

RA Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.; 

RT "The genome sequence of Drosophila melanogas ter . " ; 

RL Science 2 87:2185-2195(2000). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Berkeley; 

RA Stapleton M. , Brokstein P., Hong L., Agbayani A., Carlson J., 

RA Champe M. r Chavez C, Dorsett V., Dresnek D., Farfan D., Frise E . , 

RA George R. , Gonzalez M. , Guarin H,, Kronmiller B., Li P., Liao G. , 

RA Miranda A., Mungall C.J., Nunoo J., Pacleb J., Paragas V. , Park S. r 

RA Patel S., Phouanenavong S-, Wan K., Yu C, Lewis S.E., Rubin G.M., 

RA Celniker S. ; 

RL Submitted (MAR-2 002) to the EMBL/ GenBank/ DDB J databases. 

DR EMBL; AE003763; AAF56769.1; 

DR EMBL; AY089551; AAL90289.1; -. 

DR FlyBase; FBgn0039566; CG4849. 

DR InterPro; IPR000640; EFG_C . 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; IPR000795; EF_GTPbind. 

DR InterPro; IPR005225; Small_GTP. 

DR Pfam; PF00679; EFG_C; 1. 

DR Pfam; PF03764; EFG_IV; 1. 

DR Pfam; PF00009; GTP_EFTU; 1. 

DR Pfam; PF03144; GTP_EFTU_D2; 1. 

DR PRINTS; PR00315; ELONGATNFCT . 

DR TIGRFAMs; TIGR00231; small_GTP; 1. 

KW GTP-binding; Protein biosynthesis. 

SQ SEQUENCE 975 AA; 110649 MW; 68A9072B3300E9CB CRC64; 

Query Match 73.1%; Score 1848; DB 5; Length 975; 

Best Local Similarity 72.4%; Pred. No. 4.3e-140; 

Matches 347; Conservative 57; Mismatches 75; Indels 0; Gaps 0; 

YYSTDENLILSPLLGNVCFSSSQYSICFTLGSFAKI YADTFGDINYQEFAKRLWGDI YFN 62 
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Db 
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Qy 
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Db 
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Db 
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183 


Db 
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Qy 
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Db 
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I 1 I I E i ! I I I : i ! : : I E I I E I I : M I i II Ml : I : I I 



DGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRL 2 42 

! I I 1 : : I t : , I I : : I I I : I I I I I : I i 1 I I I I I : I I I I I : I MM 

YGT LMVHSSKMYPNDDCTFFQVLARIVS GTLHAGQEVRVLGENYTLQDEEDSRILQVGRL 533 

WISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKI 3 02 
I : : t I : t : I I I I i I I I I I I E I : M I I I I M I : ■ I II II I I I I MM) 

WF 1 F..qRYK\/RT,NT5VPA^NWVT 1 TF.GTDOCIVKTSTIVDINVPEDLYIFRPLKFNTQSIIKI 5 93 



Qy 303 AVEPVNPSELPKMLDGLRKWKSYPSLTTWEESGEHVILGTGELYLDCVMHDLRKMYSE 362 

[ | | | M I I I I I I I I I I I I I I I I ! I I I : i : I I I I I I I ! I I I I I I ! I I 1 1 I I I I M M I I I 
Db 594 AVEPWPSELPKMLDGLRKWKSYPLLSTRVEESGEHVILGTGELYLDCVMHDLRKMYSE 653 

Qy 363 IDIKVADPVVTFCETWETSSLKCEAETPNKKNKITMIAEPLEKGLAEDIENEWQITWN 422 

I | | | | M II I I I I I I I I I II I I M I I I I I I I I I I I I I : I M I I I I I 1 M I I I I II 
Db 654 IDIKVADPWAFCETWETSSLKCFAETPNKKNKITMISEPLEKGLAEDIENGTVCINWN 713 

Qy 423 RKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGSVKDSIVQG 481 

: j : : I I I I I I I I I I I I I I I I I I I I t ! : I I I I I I I I I I I I I I I I I II : I I I I I I I I 
Db 714 KKRI GEFFQVNYDWDLLAARS IWAFGPDSTGPNI LVDDTLPS EVDKNLLTAVKDS IVQG 7 72 



RESULT 4 
Q23463 

ID Q23463 PRELIMINARY; PRT; 974 AA. 

AC Q23463; 

DT 01-NOV-1996 (TrEMBLrel. 01, Created) 

DT 01-NOV-1996 {TrEMBLrel. 01, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Hypothetical 110.5 kDa protein. 

GN ZK32 8.2. 

OS Caenorhabditis elegans . 

OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; Rhabditoidea ; 

OC Rhabditidae; Peloderinae; Caenorhabditis. 

OX NCBI_TaxID=62 39; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Bris tol N2 ; 

RX MEDLINE=99069613; PubMed=985 19 16 ; 

RA None; 

RT "Genome sequence of the nematode C. elegans: a platform for 

RT investigating biology. The C. elegans Sequencing Consortium."; 

RL Science 282:2012-2018(1998). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Bris tol N2 ; 

RA Favello A. ; 

RT "The sequence of C. elegans cosmid ZK328."; 

RL Submitted (MAR-1996) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Bris tol N2 ; 

RA Waters ton R. ; 

RT "Direct Submission."; 

RL Submitted (JUN-2001) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; U50193; AAA91248.1; -. 

DR WormPep; ZK328.2; CE05066. 

DR InterPro; IPR000640; EFG_C. 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; IPR000795; EF_GTPbind. 

DR Pfam; PF00679; EFG_C; 1. 

DR Pfam; PF03764; EFG_IV; 1. 

DR Pfam; PF00009; GTP_EFTU; 1. 

DR Pfam; PF03144; GTP_EFTU_D2; 1. 

DR PRINTS; PR00315; ELONGATNFCT . 



KW Hypothetical protein; GTP-binding; Protein biosynthesis. 
SQ SEQUENCE 974 AA; 110484 MW; 44C4A5FDA61F0 13C CRC64; 



Query Match 69.8%; Score 1764.5; DB 5; Length 974; 

Best Local Similarity 69.1%; Pred. No. 2.3e-133; 

Matches 331; Conservative 66; Mismatches 81; Indels 1; Gaps 1 

Qy 4 YSTDENLILSPLLGNVCFSSSQYSICFTLGSFAKI YADTFGD-INYQEFAKRLWGDI YFN 62 

: : : : :\\\\ III Ml : I : : I I : I I I : I I I IE I : I I I : I I I I I I I I 

Db 295 FAEEDVPVLSPLNGNVIFSSGRYNVCFSLLSFSNI YAKQHGDSFNSKEFARRLWGDI YFE 354 

Qy 63 PKTRKFTKKAPTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKL 122 

MEM I ! : I : : I : ! I : I I I I I : I I I : I I I I I I I I II : Mil MM! I : 
Db 355 KKTRKFVKKSPSHDAPRTFVQFILEPMYKIFSQWGDVDTCLPDVMAELGIRLSKEEQKM 414 

Qy 123 NI RPLLRLVCKKFFGEFTGFVDMCVQHI P S PKVGAKPKI EHT YTGGVDS DLGEAMSDCDP 182 

I : | I I : I : I | : I M M : M I : I M I IE M I I I M I I I I : I M 
Db 415 NVRPLIALICKRFFGDFSAFVDLWQNIKSPLENAKTKIEQTYLGPADSQLAQEMQKCNA 474 

Qy 183 DGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRL 242 

Mill t I M : 1 Ml i II ! : I II : I I M II I M : : M I II : Mill 
Db 475 EGPLMVHTTKNYPVDDATQFHVFGRVMSGTLEANTDVRVLGENYSIQDEEDCRRMTVGRL 534 

Qy 243 W I S VARYH I EVN RVP AGNWVL I E GVDQ P I VKT AT I T E P RGN E EAQ I FRP L K FN T T S VI K I 302 

: : M I II 1 : I I I I I II I II M II II I I II II I I = II II II M I E : I : 

Db 535 FVRVAS YQIEVSRVPAGCWVLIEGIDQPIVKTATIAELGYEEDVYIFRPLKFNTRSCVKL 594 

Qy 303 AVEPWPSELPmLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSE 362 

El I I : II ! II I El f I I i I I I I I I I I I I I M I II I I I I M I I ! I I M II I I I M I : M 1 
Db 595 AVEPINPSELPKMLDGLRKVNKSYPLLTTRVEESGEHVLLGTGEFYMDCVMHDMRKVFSE 654 

Qy 363 IDIKVADPWTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWN 422 

I I I I I I I I I II I I I I : II M I I I ! I I II I I I II I II M I M I I I II II I II II I II 
Db 655 IDIKVADPWT FNETVIETSTLKCFAETPNKKNKITMMAEPLEKQLDEDIENEWQI GWN 714 

Qy 423 RKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGSVKDSIVQG 481 

I : : | II M I II M I II II II II I 11 I M I I I M M I I II I M II I II M : M M I I 
Db 715 RRRLGEFFQTKYNWDLLAARS IWAFGPDTTGPNI LLDDTLP S EVDKHLLSTVRESLVQG 7 73 



RESULT 5 


Q19070 


ID 


Q19070 PRELIMINARY; PRT; 849 AA. 


AC 


Q19070; 


DT 


01-NOV-1996 (TrEMBLrel. 01, Created) 


DT 


01-NOV-1996 (TrEMBLrel. 01, Last sequence update) 


DT 


01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 


DE 


Eft-1 protein. 


GN 


EFT-1. 


OS 


Caenorhabditis elegans . 


OC 


Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; Rhabditoidea ; 


oc 


Rhabditidae; Peloderinae; Caenorhabditis. 


ox 


NCBI TaxID=6239; 


RN 


[1] 


RP 


SEQUENCE FROM N.A. 


RC 


STRAIN=Bristol N2 ; 


RX 


MEDLINE=92 153310; PubMed=1739435 ; 



RA Ofulue E.N., Candido E.P.; 

RT "Isolation and characterization of eft-1, an elongation factor 2-like 

RT gene on chromosome III of Caenorhabditis elegans."; 

RL DNA Cell Biol. 11:71-82(1992). 

DR EMBL; M86958; AAA21824.1; 

DR InterPro; IPR000640; EFG_C . 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; I PRO 007 95; EF_GTPbind. 

DR Pfam; PF00679; EFG_C; 1. 

DR Pfam; PF037 64; EFG_IV; 1. 

DR Pfam; PF00009; GTP_EFTU; 1. 

DR Pfam; PF03144; GTP_EFTU_D2; 1. 

DR PRINTS; PRO 0315; ELONGATNFCT . 

KW GTP-binding; Protein biosynthesis. 

SQ SEQUENCE 849 AA; 96233 MW; 2FF8B4C95BD4 87 6B CRC64; 

Query Match 69.4%; Score 1754.5; DB 5; Length 849; 

Best Local Similarity 68.9%; Pred. No. 1.2e-132; 

Matches 330; Conservative 66; Mismatches 82; Indels 1; Gaps 1; 

YSTDENLILSPLLGNVCFSSSQYSICFTLGSFAKI YADTFGD— INYQEFAKRLWGDI YFN 62 

: : : : : I I II lit III : I : : I I : I II: EM II I : I I I : I I I I I I I I 
FAEEDVPVLSPLNGNVIFSSGRYNVCFSLLSFSNIYAKQHGDSFNSKEFARRLWGDIYFE 22 9 

PKTRKFTKKAPTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKL 122 

| I M I I I : I : : I = M : I 11 M : I I I : I I I I I I I I II : MM Mill M 
KKTRKFVKKSPSHDAP RTFVQFILEPMYKIFSQVVGDVDTCLPDVMAELGIRLSKEEQKM 28 9 

NIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDP 182 

I : | I M I : I M I I M I : I M : I I : I I I II I I I I I I 111 = 1 M 
NVRPLIALICKRFFGDFSAFVDLWQNIKSPLENAKTKIEQTYLGPADSQLAQEMQKCNA 34 9 

DGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRL 242 

Mill I I II : I I I I I M I M I M I I M II I I I : : M I I I : Mill 
EGPLMVHTTKNYPVDDATQFHVFGRVMSGTLEANTDVRVLGENYSIQDEEDCRRMTVGRL 409 
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Db 
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I II 1 M I I It II I I 1 1 M I II II II I I I M II I I I II I I 



AVEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSE 362 

I | | 1 : I I M I I I II I I M I I I I I II I M : It i I I 1 I E : I I I I t I M I I II I M I : M t 
AVEPINPSELPKMLDGLRKVNKS YPLLTTRVEESGEHVLLGTGEFYMDCVMHDMRKVFSE 52 9 

IDIKVADPVVTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWN 422 

M II I I I 1 I I M t II M M M II I II II II II It 1 I M II I I I I I M 1 I I II II II 

IDIKVADPWTFNETVIETSTLKCFAETPNKKNKITMMAEPLEKQLDEDIENEWQIGWN 58 9 
RKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGSVKDSIVQG 481 

I : M I I II II I I M I I I M I M II I I I M I It I M II 1 I I I I I I i I M : M M II 

RRRLGEFFQTKYNWDLLAARSIWAFGLDTTGPNILLDDTLPSEVDKHLLSTVRESLVQG 64 8 



RESULT 6 
Q9LNC5 

ID Q9LNC5 PRELIMINARY; PRT; 987 AA. 



AC Q9LNC5; 

DT 01-OCT-2000 (TrEMBLrel. 15, Created) 

DT 01-OCT-2000 (TrEMBLrel. 15, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE F9P14.8 protein. 

GN F9P14.8. 

OS Arabidopsis thaliana (Mouse-ear cress) . 

OC Eukaryota; Viridiplantae; Streptophyta ; Embryophyta; Tracheophyta ; 

OC Spermatophyta ; Magnoliophyta ; eudicotyledons ; core eudicots; Rosidae; 

OC eurosids II; Brassicales; Brassicaceae ; Arabidopsis. 

OX NCBI_TaxID=3702; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=cv. Columbia; 

RA Sakano H., Vaysberg M., Lee J.M. , Lenz C, Liu S.X., Pham P., 

RA Toriumi M. , Yu G . , Chin C, Chiou J . , Choi E., Chung M. , Gonzalez A., 

RA Howng B., Liu A., Altafi H., Brooks S., Buehler E . , Chao Q. , Conn L . , 

RA Conway A.B., Hansen N.F., Johnson-Hopson C, Khan S., Kim C, Lam B., 

RA Miranda M. , Nguyen M. , Palm C.J., Shinn P., Southwick A., Davis R.W., 

RA Ecker J.R., Federspiel N.A. , Theologis A.; 

RT "The sequence of BAC F9P14 from Arabidopsis thaliana chromosome 1 . " ; 

RL Submitted (JUN-2000) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AC025290; AAF80219.1; -. 

DR InterPro; IPR002048; EF-hand. 

DR InterPro; IPR000640; EFG_C . 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; IPR000795; EF__GTPbind. 

DR InterPro; IPR005225; Small_GTP. 

DR Pfam; PF00679; EFG_C; 1. 

DR Pfam; PF037 64; EFG_IV; 1. 

DR Pfam; PF00009; GTP_EFTU; 1. 

DR Pfam; PF03144; GTP_EFTU_D2; 1. 

DR PRINTS; PR00315; ELONGATNFCT . 

DR TIGRFAMs; TIGR0 0231; small_GTP; 1. 

DR PROSITE; PS00018; EF_HAND; 1. 

KW GTP-binding; Protein biosynthesis. 

SQ SEQUENCE 987 AA; 110441 MW; 8 68DD5F6597FBB7 9 CRC64; 

Query Match 60.5%; Score 1528.5; DB 10; Length 987; 

Best Local Similarity 61.2%; Pred. No. 2.6e-114; 

Matches 289; Conservative 69; Mismatches 113; Indels 1; Gaps 

iSPLLGNVCFSSSQYSICFTLGSFAKI YADTFG-DINYQEFAKRLWGDI YFNPKTRKFT 
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Db 
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Qy 
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FRTKYDWDLLAARSIWAFGPDKQGPNILLDDTLPTEVDRNLMMAVKDSIVQG 783 




Db 


732 





RESULT 7 
094316 

ID 094316 PRELIMINARY ; PRT; 983 AA. 

AC 094316; 

DT 01-MAY-1999 ( TrEMBLrel . 10, Created) 

DT 01-MAY-1999 (TrEMBLrel. 10, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Similar to Human U5 snRNP-specif ic ribosomal translocase EF-2 . 

GN SPBC215.12. 

OS Schizosaccharomyces pombe (Fission yeast) . 

OC Eukaryota; Fungi; Ascomycota; Schizosaccharomycetes ; 

OC Schizosaccharomycetales ; Schizosaccharomycetaceae ; 

OC Schizosaccharomyces . 

OX NCBI_TaxID=48 96; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=972h-; 

RA Lyne M. , Rajandream M.A. , Barrell B.G., Rieger M; 

RL Submitted (NOV-1998) to the EMBL/GenBank/DDB J databases. 

DR EMBL; AL033534; CAA22126.1; 

DR GeneDBJPombe; SPBC215.12; -. 

DR InterPro; IPR000640; EFG_C . 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; IPR000795; EF_GTPbind. 

DR InterPro; IPR005225; Small__GTP. 

DR Pfam; PF00679; EFG_C; 1. 

DR Pfam; PF03764; EFG_IV; 1. 

DR Pfam; PF00009; GTP_EFTU; 1. 

DR Pfam; PF03144; GTP_EFTU_D2; 1. 

DR PRINTS; PR00315; ELONGATNFCT . 

DR TIGRFAMs; TIGR00231; small_GTP; 1. 

KW GTP-binding; Protein biosynthesis. 

SQ SEQUENCE 983 AA; 111174 MW; 107BBE37 50B30 9 84 CRC64; 

Query Match 52.8%; Score 1335; DB 3; Length 983; 

Best Local Similarity 53.7%; Pred. No. l.le-98; 

Matches 257; Conservative 74; Mismatches 146; Indels 2; Gaps 



Qy 


5 


STDENLILSPLLGNVCFSSSQYSICFTLGSFAKI YADTFGDINYQEFAKRLWGDI YFNPK 

| I : | j | 1 | | | | : 1 1 1 1 1 II I 1 : 1 1 11= 1 : I 1 E M 1 1 I 1 : 1 

SKDLKYRVSPELGNVCFASCDLGYCFTLSSFAKLYIDRHGGIDVDLFSKRLWGDI YFDSK 


64 


Db 


304 


363 


Qy 


65 


TRKFTKKAPTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLNI 

MM!:: j M 1 1 1 1 M 1 1 I 1 : - 1 1 : I 1 : 1 : : t : 

TRKFAKQSLDGSGVRSFVHFILEPLYKLHTLTI SDEAEKLKKHLSSFQI YLKPKDYLLDP 


124 


Db 


364 


423 


Qy 


125 


RPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDC — DP 

: M 1 : 1 : 1 III llh : 1 1 1 1 1 : 1 1 : | | : M : 1 : 1 : : : 

KPLLQLICASFFGFPVGFVNAVTRHIPSPRENAARKASQSYIGPINSSIGKAILEMSREE 


182 


Db 


424 


483 


Qy 


183 


DGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRL 

M : i 1 1 : : : 1 I |: Mill: I 1 I M U 1 1 1 : 1 M ! 1 i : : : 

SAP LVMHVT KL YNT VDANNFYAFARVYS GQVKKGQKVKVLGEN YS LEDEEDMVVAH I AEI 


242 


Db 


484 


543 


Qy 


243 


W I S VAR YH I EVN RVP AGNWVL I E GVDQ P I VKT AT I T E P RGN E E AQ I F R P L K E'NT T S VI K I 
: M | : | : I I 1 1 : 1 1 1 1 1 1 1 1 1 MM: : I I I : 
CVPCARYRLHVDGAVAGMLVLLGGVDNSI SKTATIVS DNLKDDPYI FRPIAHMSESVFKV 


302 


Db 


544 


603 


Qy 


303 


AVE PVNPSELPKMLDGLRKVNKS YPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSE 

iiii i i i i i i i iiiiii iiiii i i i i t i I I I 1 1111*1*11***111 MM 
MM 11 1 11 II M II 1 M II 1 I 1 1 1 1 1 M 1 1 1 1 I I I I - I * it •■■ i i i -iii 

AVEPHNPSELPKLLDGLRKTNKSYPLSITKVEESGEHTIFGTGEMYMDCLLYDLRTLYSE 


362 


Db 


604 


663 


Qy 


363 


IDIKVADPVVTFCETVVETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWN 

1 M : 1 : II 1 IIII I : 1 II : II 1 :M 1 II M : II 1 : MINI:: MM ! 1 1 
IEIRVSDPVARFCETAVDTSSIKCFSDTPNKKNRITMWEPLEKGISNDIENGKVNINWP 


422 


Db 


664 


723 


Qy 


423 


RKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGSVKDSIVQG 

M : : MM 1 II 1 1 I 1 M 1 M 1 M 1 1 1 1 1 : : M 1 : 1 1 M : 1 1 1 

QKRI S EFFQKN YDWDLLAS RS I WAFGP DDRGTN I LRDDT L ST DVDKNVLN SVKEYIKQG 


481 


Db 


724 


782 



RESULT 8 
Q8IJZ9 

ID Q8IJZ9 PRELIMINARY; PRT; 1235 AA. 

AC Q8IJZ9; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE U5 small nuclear ribonuclear protein, putative. 

GN PF10_0041. 

OS Plasmodium falciparum (isolate 3D7) . 

OC Eukaryota; Alveolata; Apicomplexa; Haemosporida ; Plasmodium. 

OX NCBIJTaxID=36329; 
RN [1] 

RP SEQUENCE FROM N . A. 

RC STRAIN=3D7; 

RX MEDLINE=22255705; PubMed=12368 864 ; 

RA Gardner M.J., Hall N., Fung E., White O., Berriman M. , Hyman R.W., 

RA Carlton J.M., Pain A., Nelson K.E., Bowman S., Paulsen I.T., James K., 

RA Eisen J.A., Rutherford K., Salzberg S.L., Craig A., Kyes S., 

RA Chan M.-S., Nene V., Shallom S.J., Sun B., Peterson J., Angiuoli S., 

RA Pertea M. , Allen J., Selengut J., Haft D., Mather M.W., Vaidya A.B., 

RA Martin D.M.A. , Fairlamb A.H., Fraunholz M.J., Roos D.S., Ralph S.A., 

RA McFadden G.I., Cumraings L.M., Subramanian G.M., Mungall C, 



RA Venter J.C., Carucci D.J., Hoffman S.L., Newbold C, Davis R.W., 

RA Fraser CM., Barrell B.; 

RT "Genome sequence of the human malaria parasite Plasmodium 

RT falciparum. "; 

RL Nature 419:4 98-511(2002). 

DR EMBL; AE014829; AAN35239.1; -. 

KW Nuclear protein. 

SQ SEQUENCE 1235 AA; 142943 MW; EEE0EE55 92AFEB6E CRC64 ; 

Query Match 42.0%; Score 1062; DB 5; Length 1235; 

Best Local Similarity 38.3%; Pred. No. 1.6e-76; 

Matches 235; Conservative 80; Mismatches 159; Indels 140; Gaps 7; 

Qy 8 ENLILSPLLGNVCFSSSQYSICFTLGSFAKI YADTFG — DINYQEFAKRLWGDI YFNPKT 65 

: : : | M I I I I I I I I : I I I I I : I I I : • : 1 : > > : MINI::: 
Db 422 KDFLLSPLKNNVLFSSSIYGVFFTLKSFSKI YCNIYNAYNIDIDEFSLYLWGDIYYDEEN 481 

Qy 66 RKFTKKAPTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLNIR 12 5 

|| : : : I : I I I M I M I I I I : I I III: I : 

Db 482 FTFVKSPLYANQKRT FVEFILNPLYKI FGYVCSEEKEFLI PFLQSFNI SLKKSDYLYNTK 541 

Qy 126 PLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDC DP 182 

||:: | : I I I I : : : I M III I : I : : : I I I 

Db 542 YLLKKINGMIFQDTTAFVDIIIDNCPSPLDNAKNKTLQI YSGSLKTKISYDMMKCMKGDE 601 

Qy 18 3 DGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQ1CTVGRL 242 

||: | : : I I I I : I II I I I :: I I I I I I I : I I : I 

Db 602 TDNLMI YIIKNYHRPECGMIDLFGRVMCGTIKKGQSVRILGEGYTLNDDEDMITRVITHL 661 

Qy 243 W I S VARYH I EVNRVP AGNWVL I EGVDQ P I VKT AT I T E P R 281 

|| I I ! I I : : I I I I : I I 1 : I I I I I 1 I = 
Db 662 WI YEGRYRI EVDEI PAGNFCLI GGI DI CINKTCT ITNVKI KTNKNKEDHVNLNWYDDLND 721 



QY 



282 GNEEAQI FRPL — KFN TTSVIKIAVEPVNPSE 311 

M I : I I I I II 111:111:1111 



Db 722 DDNNIVKYKNDKDIYNLDKSALLNDNENAEIFYPLHKKFRYLNCVNSVFKVACEPINPSE 7 81 

Qy 312 LPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEIDIKVADPV 371 

| | | t 1 : I I II : : I II : I I I I I I I I I : I I I I I I I I I I I :: I I I I I : I = = = I M : I I I 
Db 782 LPKMLEGLRKIDKVYPLSSTKVEESGEHIILGTGELYLDCILHDLRKLYGDLEIKVSDPV 841 

Qy 372 VTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWNRK-KLGEFF 4 30 

| | M :: I II : I I I I ! I I I I I I I I I 1 I I I :: I I : 1 1 : I : = : : ' : 
Db 842 VQFNETIIETSALNCFAETPNKKNKIYMIAEPVQKELGDDIVQGLVHLNEDQNLNVNEYI 901 

432 

Qy 431 QT ^ Z 

Db 902 STMDRILNKNNESKRNI YDEKDDSHDDNDDGDNMENHMNGIENKKNIFKDKEQKDEHQIN 9 61 

433 KYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVD 467 

1 | : I i : I : I I : I 1 1 I t : I I I I I I I : I I : 
Db 962 DLDKEKRKSTLNYNIDPNVISLLKNKYNWDILSIRSLWAFGPENNSPNILVDDSLFQETN 1021 



Qy 



Qy 

Db 



4 68 KALLGSVKDSIVQG 481 
I 1 1 : I I : I : I I 
1022 KENLYSIKDNIIQG 1035 



RESULT 9 
Q9BNW7 

ID Q9BNW7 PRELIMINARY; PRT; 728 AA. 

AC Q9BNW7; 

DT 01-JUN-2001 (TrEMBLrel. 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last annotation update) 

DE Elongation factor-2 (Fragment). 

OS Scolopendra polymorpha. 

OC Eukaryota; Metazoa; Arthropoda; Myriapoda; Chilopoda; 

OC Pleuros tigmophora ; Scolopendromorpha ; Scolopendridae ; Scolopendra . 

OX NCBI_TaxID=109757; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=2 1317 0 60; PubMed= 11421654; 

RA Regier J.C., Shultz J.W. ; 

RT "Elongation factor-2: a useful gene for arthropod phylogenetics . " ; 

RL Mol. Phylogenet. Evol . 20:136-14 8(2 001). 

DR EMBL; AF240828; AAK12353.1; -. 

DR HSSP; P13551; 1FNM. 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; IPR000795; EFjSTPbind. 

DR Pfam; PF03764; EFG_IV; 1. 

DR Pfam; PF00009; GTPJ3FTU; 1. 

DR Pfam; PF03144; GTP_EFTU_D2 ; 1. 

DR PRINTS; PR00315; ELONGATNFCT . 

DR PROSITE; PS00301; EFACTORJGTP; 1. 

KW GTP-binding; Protein biosynthesis. 

FT NON_TER 1 1 

FT NON TER 72 8 728 

SQ SEQUENCE 728 AA; 81821 MW; 69FB99C6A82 0E8B9 CRC64; 



Query Match 



35.8%; Score 903.5; DB 5; Length 728; 

Best" Local Similarity 39.7%; Pred. No. 4.6e-64; 
Matches 187; Conservative 99; Mismatches 176; Indels 9; Gaps 4; 



Q 12 LSpLLGNVCFSSSQYSICFTLGSFAKI YADTFGDINYQEFAKRLWGDI YFNPKTRKFTK- 7 0 

: | | : ! | | : I I I I : : I I I : ! I : : : 11111= : : | | I : : I : I 
Db 201 VDPSKGSVGFGSGLHGWAFTLKQFSEI YAEKF-KIDVEKLMKRLWGENFYNPKSKKWAKS 259 

Q 71 KAPTSSSQRSFVEFILEPLYKILAQVVGDVDTSLPRTLDELGIHLTKEELKLNIRPLLRL 130 

I i:|:M: := :|: l = M lit: = : ll=: 

Db 260 NDETGEFKRSFCMFILDPIYKVFDAIMNYKTDEIPKLLEKLNIILKGEDKDKDGKSLLKV 319 

Qv 13 i VCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDPDGPLMCHT 190 

Y [ : : : : | I : I 1 I I : : I I I I : I : M M I I I I : 

Db 320 VMRQWLPAGEALLQMIAIHLPSPVKAQKYRMEMLYEGPHDDEAAVAVKNCDPSGPLMMYV 37 9 

Qy 191 TKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLWISVARYH 250 

:|| II :|: MM II : M Ml Ml : I = = M ^ 

Db 380 SKMVPTSDKGRFYAFGRVFSGWSTGQKVRIMGPNYTPGKKEDLYEKAIQRTILMMGRYT 439 

Qv 251 IEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQI FRPLKFNTTSVI KIAVEPVNPS 310 

: l| |1 : | M I M M M I : M | : | | : : ! : : : I I II I I I 

Db 440 EAIEDVPCGNICGLVGVDQFLVKTGTITT FKDAHNMRVMKFSVS PWRVAVE P KN P S 496 



Qy 311 ELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEIDIKVADP 370 

: | | | : : : | | : : : II I : : i I I I I I : I 1 I I I : I : : I I = = = ' I I I : I I 

Db 497 DLPKLVEGLKRLAKSDPMVQCI IEESGEHIIAGAGELHLEICLKDLEEDHACIPIKVSDP 556 

Qy 371 VVTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWNRKKLGEFF 430 

1 | : : ! I I I I : I I : : M I ! : : I I : i : : ! I I i I : I : ' : 

D b 557 WS YRETVSEESEIMCLAKS PNKHNRLFMKAQPMPEGLPEDI DKGDVTARDDFKARARYL 616 

Qy 431 QTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGSVKDSIVQG 481 

I 1 I : I : I I I I i i i 1 I I I I If : I I : I : I I I : I I 

D b 617 SDKYDYDVTEARKIWCFGPDGTGPNILIDCTKGVQ YLNEIKDSWAG 663 



RESULT 10 
Q9BNX1 

ID Q9BNX1 PRELIMINARY; PRT; 726 AA. 

AC Q9BNX1; 

DT 01-JUN-2001 (TrEMBLrel. 17, Created) 

DT 01-JUN-2001 (TrEMBLrel . 17, Last sequence update) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last annotation update) 

DE Elongation factor-2 (Fragment) . 

OS Nipponopsalis abei . 

OC Eukaryota; Metazoa; Arthropoda; Chelicerata; Arachnida; Opiliones; 

OC Palpatores; Troguloidea ; Nipponopsalididae ; Nipponopsalis. 

OX NCBI_TaxID=109753; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=2 1317 0 60; PubMed=1142 1 654 ; 

RA Regier J.C., Shultz J.W. ; 

RT "Elongation factor-2: a useful gene for arthropod phylogenetics . " ; 

RL Mol. Phylogenet. Evol . 2 0:136-148(2001). 

DR EMBL; AF240824; AAK12349.1; 

DR HSSP; P13551; 1FNM. 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU__D2 . 

DR InterPro; IPR000795; EF_GTPbind. 

DR InterPro; IPR005225; Small_GTP . 

DR Pfam; PF03764; EFG_IV; 1. 

DR Pfam; PF0000 9; GTP_EFTU; 1. 

DR Pfam; PF03144; GTP_EFTU_D2; 1. 

DR PRINTS; PR00315; ELONGATNFCT . 

DR TIGRFAMs; TIGR00231; small_GTP; 1. 

DR PROSITE; PS00301; EFACTOR_GTP; 1. 

KW GTP-binding; Protein biosynthesis. 

FT NON_TER 1 1 

FT NON_TER 72 6 726 

SQ SEQUENCE 726 AA; 81041 MW; 8EAADFAFDD2C3DED CRC64; 

Query Match 34.9%; Score 881.5; DB 5; Length 726; 

Best Local Similarity 37.9%; Pred. No. 2.7e-62; 

Matches 178; Conservative 108; Mismatches 175; Indels 9; Gaps 4 

Qy 12 LSPLLGNVCFSSSQYSICFTLGSFAKIYADTFGDINYQEFAKRLWGDI YFNPKT RKFTKK 71 

: | | : | | I : I ill I : : I I I : I I : : : i M I : : : | I : : : I : : I I 

Db 201 VDPSKGSVGFGSGLHSWAFTLKQFSEIYAEKF-KIDVEKLMNRLWGENFYNPQSKKWSKK 259 



Qy 72 APTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLNIRPLLRLV 131 

: I : 1 I : I : f : I I I : : I : I : : I I I : : : : : | | : : j 

Db 260 M-DEGFKRAFCMFVLDPIYKI FKAIMGYQKEETAKLLEKLNIILKGDDKEKDGKNLLKVV 318 

Qy 132 CKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDPDGPLMCHTT 191 

: : : I I : I M : : I : II I : II I M I I I I I : : 

Db 319 MRNWLPAGDALLQMIAIHLPSPVTAQRYRIDLLYEGPQDDEAAVAMKSCDPDGPLMMYI S 378 

Qy 192 KMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLWISVARYHI 251 

II II : I : I I I I I I : : I I I : : : I I I : : I : I : : I 

Db 37 9 KMVPTSDKGRFYAFGRVFSGIVSSGQKVRIMGPNYLPGKKDDLAEKAIQRTVLMMGRAVE 43 8 

Qy 252 EVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIAVEPVNPSE 311 

: I I : I I : INI : I I I I I : : : I I : I I : : I : : : I I I I : i I I : 

Db 439 P I EN VP S GN I C GLVGVDQ FLVKT GT 1ST FKDAHNMRVMKFS VS P WRVAVE PMN PSD 495 

Qy 312 LPKMLDGLRKA^KSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEIDIKV7U)PV 371 

I I I : : : I I : : : I I I : : I I I I I i : : I I I I : I : : | | : : : | : | IN 
Db 496 LPKLVEGLKRLAKSDPMVQCIIEESGEHIVAGAGELHLEICLKDLEEDHAGI PLKKTDPV 555 

Qy 372 VTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWNRKKLGEFFQ 431 

I I : I I I I I : : I : : : I I I I : : I I I : : I I I M : : I : I I : 

Db 556 VTYRETVAEESAIMCLSKSPNKHNRLYMKATPMQDGLPEDIDSGAVNPKDDFKARGRYLS 615 

Qy 432 TKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGSVKDSIVQG 481 

I I : I I M IE I M : I I I I : I I I I : I : II I : I I 

Db 616 DKYEWDATEARKIWCFGPEGTGPNLLVDVTKGVQ YLNEIKDSWAG 661 



RESULT 11 
Q9BNW9 

ID Q9BNW9 PRELIMINARY; PRT; 660 AA. 

AC Q9BNW9; 

DT 01-JUN-2001 (TrEMBLrel . 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last annotation update) 

DE Elongation factor-2 (Fragment) . 

OS Polyxenus f asciculatus . 

OC Eukaryota; Metazoa; Arthropoda; Myriapoda; Diplopoda; Penicillata; 

OC Polyxenida; Polyxenidae; Polyxenus. 

OX NCBI_TaxID=5 8 78 6; 

RN [1] 

RP SEQUENCE FROM N . A. 

RX MEDLINE-21317 0 60; PubMed^l 142 1654 ; 

RA Regier J.C., Shultz J.W.; 

RT "Elongation factor-2: a useful gene for arthropod phylogenetics . " ; 

RL Mol. Phylogenet. Evol. 2 0:136-148(2001). 

DR EMBL ; AF240826; AAK12351.1; -. 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; IPR000795; EF_GTPbind. 

DR Pfam; PF03764; EFG_IV; 1. 

DR Pfam; PF00009; GTP_EFTU; 1. 

DR Pfam; PF03144; GTP__EFTU_D2 ; 1. 

DR PRINTS; PR00315; ELONGATNFCT. 

DR PROSITE; PS00301; EFACTOR_GTP; 1. 

KW GTP-binding; Protein biosynthesis. 



FT NON_TER 1 1 

FT NON_TER 660 660 

SQ SEQUENCE 660 AA; 73957 MW; 303B78 91D8656E8C CRC64; 

Query Match 34.5%; Score 872.5; DB 5; Length 660; 

Best Local Similarity 38.2%; Pred. No. 1.2e-61; 

Matches 179; Conservative 103; Mismatches 177; Indels 9; Gaps 4 

Qy 12 LSPLLGNVCFSSSQYSICFTLGSFAKIYADTFGDINYQEFAKRLWGDI YFNPKTRKFTKK 71 

: I I : I I i : III I : : : I I : I I I : : : I I 1 I : : : I I I : : I : I 
Db 2 01 VDPSKGSVGFGSGLHGWAFTLKQFSELYAEKFG-IDVDKLMRRLWGENFYNPKSKKWAKS 259 

Qy 72 APTSSS-QRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLNIRPLLRL 130 

: : I I I I : I : I : II : : : : I : I : : I I I I : : : : I I : 

Db 2 60 SNEGPDFKRSFCMFVLDPI YKVFDAIMNYKTEEIPKLLEKLNIVLKGEDKEKDGKLLLKT 319 

Qy 131 VCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDPDGPLMCHT 190 

I : : : : I hill I : : I II I : : : I I I : I M I : 

Db 32 0 VMRQWLPAGEALLQMITIHLPSPVTAQKYRMELLYEGPHDDEAALGIKNCDPNGPLMMYV 37 9 

Qy 191 TKMFSTHDGVQ FHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLWISVARYH 250 

: I I II : I : I 1 I I I I : I I I : : : I II I : I I : I : : I I 

Db 38 0 SKMVPTTDKGRFYAFGRVFSGIVSTGQKVRIMGPNYTPGKKEDLYEKAIQRTILMMGRYT 439 

Qy 251 IEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQI FRPLKFNTTSVTKIAVEPVNPS 310 

: I I I I : I I I I : I I I I I : : : I I : I I : : I : : : I I I III 

Db 440 EAI E EVP CGN I CGLVGVDQ FL VKT GT 1ST FKDAHNLRVMKFSVSPWRVAVEAKNPS 496 

Qy 311 ELPKMLDGLRKWKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEIDIKVADP 370 

: I I I : : : I I : : : II I : : I I I I I I : I I I I I : I : : II : : : I M I : II 
Db 497 DLPKLVEGLKRLAKSDPMVQCIIEESGEHIIAGAGELHLEICLKDLEEDHACIPIKVSDP 556 

Qy 371 WTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWNRKKLGEFF 430 

I I : : I I I I I : I I : : I I I I : : I I I : : I I II I : I : I I 

Db 557 VVS YRETVTEESDIMCLAKS PNKHNRLYMKAVPMPEGLPEDI DKGEVTAJRDDFKSRGRLL 616 

Qy 431 QTKYDWDLLAARS IWAFGPDATGPNI LVDDTLP SEVDKALLGSVKDS I 478 

I I : : I : I I I I I I I I II I I I I : I I : I : I I I : 

Db 617 AEKYEYDITEARKIWCFGPDGTGPNILIDCTKGVQ YLNEIKDSV 660 



RESULT 12 
Q8C153 

ID Q8C153 PRELIMINARY; PRT; 858 AA. 

AC Q8C153; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Elongation factor 2. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10 090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Skin; 

RX MEDLINE=22354 683; PubMed=124 668 51 ; 



RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse trans criptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

DR EMBL; AK02 8 93 8; BAC2 62 03.1; -. 

SQ SEQUENCE 858 AA; 95258 MW; 3A33 9CD9 84F58AD0 CRC64; 

Query Match 34.4%; Score 869; DB 11; Length 858; 

Best Local Similarity 37.7%; Pred. No. 3.5e-61; 

Matches 184; Conservative 93; Mismatches 189; Indels 22; Gaps 5 

Qy 9 NLILSPLLGNVCFSSSQYSICFTLGSFAKIYADTF GDINYQEFA KRLW 56 

i : : : I : I I I I I : ill I I : : I I I : : II I : I I 

Db 202 NIMIDPVLGTVGFGSGLHGWAFTLKQFAEMYVAKFAAKGEGQLSAAERAKKVEDMMKKLW 2 61 

Qy 57 GDIYFNPKTRKFTKKAPTSSSQ RSFVEFILEPLYKILAQWGDVDTSLPRTLDELGI 113 

I I I I : I II : I I : : I : I : I I : I : : I : : : : : : : I I 

Db 262 GDRYFDPANGKFSKSANSPDGKKLPRTFCQLILDPIFKVFDAIMNFRKEETAKLIEKLDI 321 

Qy 114 HLTKEELKLNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDL 173 

I I : : | | | : | : : : : | | : | | | I : I I I I : 

Db 322 KLDSEDKDKEGKPLLKAVMRRWLPAGDALLQMITIHLPSPVTAQKYRCELLYEGPPDDEA 3 81 

Qy 174 GEAMSDCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEED 233 

: I I I I I I I : : I I II : I : I 1 I I I I : I I : : : I I I I : I I 
Db 382 AMGIKSCDPKGPLMMYISKMVPTSDKGRFYAFGRVFSGWSTGLKVRIMGPNYTPGKKED 441 

Qy 234 SQICTVGRLWISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLK 293 

: : I : : I I : I I I I I : I I II : I I I I I I II I : 1 

Db 442 L YL K P I Q RT I LMMGR YVE PIE D VP C GN I VGL VGVD Q F L VKT GT ITT FEHAHNMRVMK 498 

Qy 294 FNTTSVIKIAVEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVM 353 

I : : I : : : I I I I I : : I I I : : : M : : : II I : : I I I I I I : I I M I : I : : 
Db 499 FSVSPWRVAVELAKNPADLPKLVEGLKRLAKSDPMVQCIIEESGEHIIAGAGELHLEICL 558 

Qy 354 HDLRKMYSEIDIKVADPVVTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIE 413 

II : : : I II : M I I : : III I I : : I : : : I I I I : : ! I I MINI: 

Db 559 KDLEEDHACIPIKKSDPWSYRETVSEESNVLCLSKSPNKHNRLYMKARPFPDGLAEDID 618 

Qy 414 NEWQITWNRKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGS 4 73 

I I : I I : I I : I I I I I I M I I M I I I I : I 

Db 619 KGEVSAI^QELKAJ^RYIAEKYEWDVAEARKIWCFGPDGTGPNILTDITKGVQ YLNE 674 

Qy 474 VKDSIVQG 4 81 

: I I I : I I 
Db 675 IKDSWAG 682 



RESULT 13 
Q60423 

ID Q60423 PRELIMINARY 
AC Q60423; 

DT 01-NOV-1996 (TrEMBLrel. 
DT 01-NOV-1996 (TrEMBLrel. 
DT 01-MAR-2003 (TrEMBLrel. 
DE Elongation factor 2. 



PRT; 858 AA. 
01, Created) 

01, Last sequence update) 
23, Last annotation update) 



GN EF2 . 

OS Cricetulus griseus (Chinese hamster) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Cricetinae; 

OC Cricetulus. 

OX NCBI_TaxID=10029; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE-Ovary; 

RX MEDLINE=96032765; PubMed=7 55947 0 ; 

RA Foley B.T., Moehring J.M. , Moehring T.J.; 

RT "Mutations in the elongation factor 2 gene which confer resistance to 

RT diphtheria toxin and Pseudomonas exotoxin A. Genetic and biochemical 

RT analyses . " ; 

RL J . Biol. Chem. 270:23218-23225(1995). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Ovary; 

RX MEDLINE=86259716; PubMed=3014523 ; 

RA Kohno K. , Uchida T . , Ohkubo H., Nakanishi S., Nakanishi T., Fukui T., 

RA Ohtsuka E., Ikehara M. , Okada Y. ; 

RT "Amino acid sequence of mammalian elongation factor 2 deduced from the 

RT cDNA sequence: homology with GTP-binding proteins."; 

RL Proc. Natl. Acad. Sci. U.S.A. 83:4978-4982(1986). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC TI SSUE=Ovary ; 

RX MEDLINE=88198187 ; PubMed=2 834376; 

RA Nakanishi T., Kohno K., Ishiura M. , Ohashi H., Uchida T.; 

RT "Complete nucleotide sequence and characterization of the 5 1 -flanking 

RT region of mammalian elongation factor 2 gene."; 

RL J. Biol. Chem. 2 63:6384-63 91(198 8). 

DR EMBL; U17362; AAB60497.1; -. 

DR HSSP; P13551; 1FNM . 

DR InterPro; IPR000640; EFG_C . 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; IPR000795; EF_GTPbind. 

DR InterPro; IPR005225; Small_GTP . 

DR Pfam; PF00679; EFG_C; 1. 

DR Pfam; PF03764; EFG_IV; 1. 

DR Pfam; PF00009; GTP_EFTU; 1. 

DR Pfam; PF03144; GTP_EFTU_D2; 1. 

DR PRINTS; PR00315; ELONGATNFCT . 

DR TIGRFAMs; TIGR00231; small_GTP; 1. 

DR PROSITE; PS00301; EFACTOR_GTP; 1. 

KW GTP-binding; Protein biosynthesis. 



FT 


CONFLICT 


441 


441 


D 


-> E (IN REF. 1,3,2,2 AND 


FT 


VARIANT 


584 


584 


S 


-> G. 


FT 


VARIANT 


714 


714 


I 


-> N. 


FT 


VARIANT 


717 


717 


G 


-> R. 


FT 


VARIANT 


719 


719 


G 


-> D. 


SQ 


SEQUENCE 


8 58 AA; 


95310 


MW; 


3F103E20D69140D8 CRC64; 



Query Match 34.3%; Score 867; DB 11; Length 858; 

Best Local Similarity 37.3%; Pred. No. 5.1e-61; 

Matches 182; Conservative 94; Mismatches 190; Indels 22; Gaps 



Qy 9 NLILSPLLGNVCFSSSQYSICFTLGSFAKI YADTF GDI NYQEFAKRLW 56 

I : : : I : I I I I I : ill I I : : I I I : : : I : I I 

Db 2 02 NIMIDPVLGTVGFGSGLHGWAFTLKQFAEMWAKFAAKGEGQLGPAERT^KVEDMMKKLW 2 61 

Qy 57 GDI YFNPKT RKFTKKAPTSSSQ RSFVEFILEPLYKILAQWGDVDTSLPRTLDELGI 113 

II I I : I I I : I I : : I : I : I I : I : : I : : : : : : : I I 

Db 2 62 GDRYFDPANGKFSKSANSPDGKKLPRTFCQLILDPIFKVFDAIMNFRKEETAKLIEKLDI 321 

Qy 114 HLTKEELKLNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDL 173 

I I : : | | | : | : : : : | I : I I I I : | | I | : 

Db 322 KLDSEDKDKEGKPLLKAVMRRWLPAGDALLQMITIHLPSPVTAQKYRCELLYEGPPDDEA 3 81 

Qy 174 GEAMSDCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEED 233 

: I I I M I I : : I I II : I : I I I I I I : I I : : : I I I I : I I 
Db 382 AMGIKSCDPKGPLMMYISKMVPTSDKGRFYAFGRVFSGWSTGLKVRIMGPNYTPGKKED 441 

Qy 2 34 SQICTVGRLWISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLK 2 93 

: : I : : IE : I I I I I : I I I I : I I I I I I II I : I 

Db 442 L YLKP I QRT I LMMGRYVEP I EDVPCGN I VGLVGVDQFLVKT GT ITT FEHAHNMRVMK 498 

Qy 294 FNTTSVTKIAVE PVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVM 353 

I : : I : : : I I I I I : : I I 1 : : : I I : : : II I : : I I I I I I : I I I I I : I : : 
Db 499 FSVSPWRVAVEAKNPADLPKLVEGLKRLAKSDPMVQCIIEESGEHIIAGAGELHLEICL 558 

Qy 354 HDLRKMYSEIDIKVADPVVTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIE 413 

II : : : I II : I I I I : : III I I : : I : : : I I I I : : I I I I I I I I I : 

Db 559 KDLEEDHACIPIKKSDPWSYRETVSEESNVLCLSKSPNKHNRLYMKARPFPDGLAEDID 618 

Qy 414 NEWQITWNRKKLGEFFQTKYDWDLLAARS IWAFGPDATGPNI LVDDTLP S EVDKALLGS 473 

I I : I I : II : II II MM MUM I I : I 
Db 619 KGEVSARQELKARARYLAEKYEWDVAEARKIWCFGPDGTGPNILTDITKGVQ YLNE 674 

Qy 474 VKDSIVQG 481 

: I I I : I I 
Db 675 IKDSWAG 682 



RESULT 14 
Q9BNW0 

ID Q9BNW0 PRELIMINARY; PRT; 727 AA. 

AC Q9BNW0; 

DT 01-JUN-2001 (TrEMBLrel . 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last annotation update) 

DE Elongation factor-2 (Fragment) . 

OS Peripatus sp. * Per2 1 . 

OC Eukaryota; Metazoa; Onychophora; Peripatidae; Peripatus. 

OX NCBI_TaxID=109759 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=21317060; PubMed=11421654 ; 

RA Regier J.C., Shultz J.W.; 

RT "Elongation factor-2: a useful gene for arthropod phylogenetics . " ; 

RL Mol. Phylogenet. Evol . 2 0:136-148(2001). 

DR EMBL; AF240835; AAK12360.1; -. 

DR HSSP; P13551; 1FNM. 



DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; IPR000795; EF__GTPbind. 

DR Pfam; PF03764; EFG_IV; 1. 

DR Pfam; PF00009; GTP_EFTU; 1. 

DR Pfam; PF03144; GTP_EFTU_D2; 1. 

DR PRINTS; PR00315; ELONGATNFCT . 

DR PROSITE; PS00301; EFACTORJGTP; 1. 

KW GTP-binciing; Protein biosynthesis. 

FT NON_TER 1 1 

FT NON_TER 72 7 72 7 

SQ SEQUENCE 727 AA; 81394 MW; A7E368E3CFDE8FDB CRC64; 

Query Match 34.2%; Score 863; DB 5; Length 727; 

Best Local Similarity 37.6%; Pred. No. 8.4e-61; 

Matches 182; Conservative 101; Mismatches 189; Indels 12; Gaps 4 

Qy 2 AYYSTDE NLILSPLLGNVCFSSSQYSI CFTLGSFAKI YADTFGDINYQEFAKRLWG 57 

III: : : : I III I : ill I I : I I : : I I : : I I I I I 

Db 187 ATYSDESGPMGDIKVDPSKGNVGLGSGLHGWAFTLKQFAEIYSEKF— KIDVDKLMKRLWG 245 

Qy 58 DI YFNPKT RKFTKKAPTS S SQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTK 117 

: : : M I I I : : I I : : I : I I : I : I : i II : : : | : : | [ | 

Db 246 ENFYNPKARKWSKKCESEDYKRAFCMFVLDPIYKIFDAIMNYKKDETAKLLEKLNIVLKG 305 

Qy 118 EELKLNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVG7VKPKIEHTYTGGVDSDLGEAM 177 

I : : : | | : : | : : : I I : I I I : : I II I : : : I 

Db 306 EDKDKDGKALLKIVMRTWLPAGEALLQMIALHLPSPVTAQRYRTELLYEGPQDDEVAISM 365 

Qy 17 8 SDCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQIC 2 37 

: I : I I I I : : : I I II : I : I I I I I i : I I I : : : I I I : I I 

Db 366 KECNPQGPLIMYISKMVPTSDKGRFYAFGRVFSGLVSTGQKVRIMGPNYVPGKKEDLYEK 425 

Qy 238 TVGRLWISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTT 2 97 

: I : : I I : I I I I : I I I ! I I I I I I I : : I I : I | : : 

Db 426 AI Q RT I LMMGR YVEAI E D VP C GN I C GL VG VD Q F I VKT GT ITT FKDAHNLRVMKFSVS 482 

Qy 298 SVIKIAVEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLR 357 

I : : : I I I I M : I I I : : : I I : : : : I I I : : I I I I I I : I I 11 I : I : : II 
Db 483 PWRVAVEAKNPSDLPKLVEGLKRLSKSDPMVQCIIEESGEHIIAGAGELHLEICLKDLE 542 

Qy 358 KMYSEIDIWADPVVTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEW 417 

: : : I I I I : I I I I : : I I I : I I : : : I I I I : : I I I : Mill: I 
Db 543 EDHAGI PI KVS DPVVS YRETVSDES DTMCLSKS PNKHNRLFMKAVPMPDGLPEDI DKGEV 602 

Qy 418 QITWNRKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGSVKDS 477 

I : I I I : i : I I II I II : : I I I I I I I I : I MM 

Db 603 SPKGEFKARARYLGEKYDYDVSEARKIWCFGPEGSGPNILVDCTKGVQ YLNEIKDS 65 8 

Qy 478 IVQG 481 

: I I 

Db 659 WAG 662 



RESULT 15 
Q95P39 

ID Q95P39 PRELIMINARY; PRT; 844 AA. 



AC Q95P39; 

DT 01-DEC-2 001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel . 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Elongation factor 2. 

GN EF-2 . 

OS Aedes aegypti (Yellowfever mosquito) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota ; Diptera; Nematocera; Culicoidea; Aedes. 

OX NCBI_TaxID=7159; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=MOYO-R; TISSUE=Midgut ; 

RA Morlais I., Severson D.W.; 

RT "A targeted approach for the identification of genes determining 

RT Plasmodium gallinaceum susceptibility in Aedes aegypti."; 

RL Submitted (JUN-2 001) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AY040342; AAK77225.1; -. 

DR Inter Pro; I PRO 00 64 0; EFG_C . 

DR Inter Pro; I PRO 055 17; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; I PRO 0 07 95; EF_GTPbind. 

DR Pfam; PF00679; EFG_C; 1. 

DR Pfam; PF037 64; EFG_IV; 1. 

DR Pfam; PF00009; GTP_EFTU; 1 . 

DR Pfam; PF03144; GTP_EFTU_D2 ; 1. 

DR PRINTS; PR00315; ELONGATNFCT . 

DR PROSITE; PS00301; EFACTORJGTP; 1. 

KW GTP-binding; Protein biosynthesis . 

SQ SEQUENCE 844 AA; 94378 MW; AE35C96A2 8 667F56 CRC64; 



Query Match 34.1%; Score 862.5; DB 5; Length 844; 

Best Local Similarity 37.9%; Pred. No. 1.2e-60; 

Matches 178; Conservative 104; Mismatches 179; Indels 9; Gaps 4 

Qy 12 LSPLLGNVCFSSSQYSICFTLGSFAKI YADTFGDINYQEFAKRLWGDI YFNPKTRKFTKK 71 

: I I : I I I : II! I I : : M I I : : I 1 I I : : I I I I I : I : I 
Db 2 08 VDPSKGSVGFGSGLHGWAFTLKQFAEMYAAMF-KIDWKLMNRLWGENFFNPKTKKWA-K 2 65 



Qy 72 APTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLNIRPLLRLV 131 

: : I M I : : I : I : I I : : : : I : I : : : : I I : : : I I : : | 

Db 2 66 TKDDDNKRSEVMYVLDPIYKVFDAIMNYKTDEIPKLLEKIKVTLKHEDKDKDGKNLLKW 325 



Qy 132 CKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDPDGPLMCHTT 191 

: : : I I : I II I I : : I II i : I : : I I I : I I I I = : 

Db 326 MRSWLPAGEALLQMIAIHLPSPWAQKYRMEMLYEGPHDDEAAVAVKNCDPEGPLMMYVS 385 

Qy 192 KMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLWISVARYHI 251 

II IE : I : I I I I : I : II : : : I I I I : I I : I : : I I 

Db 386 KMVPTSDKGRFYAFGRVFAGKVATGQKCRIMGPNYTPGKKEDLYEKAIQRTILMMGRYVE 445 

Qy 252 EVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIAVEPVNPSE 311 

: I I I I : I I I I : I I I I I : : : I : : i I : : I : : : I t I I I I : : 

Db 446 AI EDVP C GN I CGL VGVDQ FLVKT GT 1ST FKDAHNMKVMKFSVS PWRVAVEPKNPAD 502 

Qy 312 LPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEIDIKVADPV 371 

I I I : : : I I : : : II I : : M I I i I : I I I I I : I : : II : : : I : I MM 



Db 503 LPKLVEGLKRLAKSDPMVQCIIEESGEHIIAGAGELHLEICLKDLEEDHACIPLKKSDPV 562 

Qy 372 VTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWNRKKLGEFFQ 431 

I : : I I I : I I : : : i M I : : ! I I : I I I I I I : I I : I : 

Db 563 VSYRETVSDESDQMCLSKSPNKHNRLFMKAVPMPDGLAEDIDNGDVNSRDDF?CVRARYLA 622 

Qy 432 TKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGSVKDSIVQG 481 

i I I : I : 1 I I I I I M I I I I I : I I I : I : I M : I I 

Db 623 EKYDYDVTE7VRKIWCFGPDGTGPNIWDCTKGVQ YLNEIKDSWAG 668 



Search completed: January 30, 2004, 11:26:21 
Job time : 59.0837 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on : 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



January 30, 2004, 11:15:32 ; Search time 18.716 Seconds 

(without alignments) 

1208.586 Million cell updates/sec 

US-09-989-481-3 
2527 

1 RAYYSTDENLILSPLLGNVC LPSEVDKALLGSVKDS IVQG 481 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 127863 seqs, 47026705 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 10 0% 
Listing first 45 summaries 



127863 



Database : 



SwissProt 41:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



Query 



o . 


Score 


Match 


Length 


DB 


ID 


Description 


1 


2491 


98 


6 


972 


1 


U5S INHUMAN 


Q15029 


homo sapien 


2 


2487 


98 


4 


971 


1 


U5Sl_MOUSE 


008810 


mus musculu 


3 


870 


34 


4 


857 


1 


EF2__HUMAN 


P13639 


homo sapien 


4 


869 


34 


4 


857 


1 


EF2_MOUSE 


P58252 


mus musculu 


5 


868 


34 


3 


857 


1 


EF2_RAT 


P05197 


rattus norv 


6 


867 


34 


3 


857 


1 


EF2 CRIGR 


P09445 


cricetulus 


7 


865 


34 


2 


857 


1 


EF2 CHICK 


Q90705 


gallus gall 


8 


863 


34 


2 


857 


1 


EF2_MESAU 


P05086 


mesocricetu 


9 


854 


33 


8 


832 


1 


EF2 CRYPV 


Q23716 


cryptospori 


10 


844.5 


33 


4 


843 


1 


EF2_DROME 


P13060 


drosophila 


11 


839 


33 


2 


842 


1 


EF2 CANAL 


013430 


Candida alb 


12 


837.5 


33 


1 


851 


1 


EF2 CAE EL 


P29691 


caenorhabdi 


13 


823.5 


32 


6 


840 


1 


EF2 ENTHI 


Q06193 


entamoeba h 


14 


816.5 


32 


3 


843 


1 


EF2 BETVU 


023755 


beta vulgar 


15 


807.5 


32 


0 


842 


1 


EF2_YEAST 


P32324 


saccharomyc 


16 


801.5 


31 


7 


842 


1 


EF2 SCHPO 


014460 


schizosacch 


17 


788 . 5 


31 


2 


844 


1 


EF2 NEUCR 


Q96x45 


neurospora 



18 


779 


30. 


8 


845 


1 


EF2_CHLKE 


P28996 


chlorella k 


19 


751 


5 


29. 


7 


867 


1 


EF2_BLAHO 


Q17152 


blastocyst! 


20 


710 


5 


28 . 


1 


830 


1 


EF2 DICDI 


P15112 


dictyos teli 


21 


668 


5 


26. 


5 


1008 


1 


SN14 YEAST 


P36048 


saccharomyc 


22 


404 


5 


16. 


0 


736 


1 


EF2_AERPE 


Q9ycl9 


aeropyrum p 


23 


396 


5 


15. 


7 


732 


1 


EF2_PYRHO 


059521 


pyrococcus 


24 


390 


5 


15. 


5 


728 


1 


EF2__ARCFU 


028385 


archaeoglob 


25 


390 


5 


15. 


5 


1110 


1 


YNQ3_YEAST 


P53893 


saccharomyc 


26 


385 


5 


15. 


3 


732 


1 


EF2 PYRAB 


Q9vlz8 


pyrococcus 


27 


385 


5 


15. 


3 


732 


1 


EF2_PYRFU 


P29050 


pyrococcus 


28 


371 


5 


14. 


7 


740 


1 


EF2 PYRAE 


Q8zzcl 


pyrobaculum 


29 


370 


14 . 


6 


736 


1 


EF2_SULTO 


Q975h5 


sulf olobus 


30 


366 


5 


14. 


5 


735 


1 


EF2_SULSO 


P30925 


sulf olobus 


31 


335 


13. 


3 


736 


1 


EF2_SULAC 


P23112 


sulf olobus 


32 


331 


13. 


1 


726 


1 


EF2_METJA 


Q58448 


methanococc 


33 


326 


5 


12 . 


9 


730 


1 


EF2_METTH 


027131 


methanobact 


34 


326 


5 


12 . 


9 


732 


1 


EF2 THEAC 


P26752 


thermoplasm 


35 


322 


5 


12 . 


8 


734 


1 


EF2_DESMO 


P33159 


desul f uroco 


36 


318 


5 


12 . 


6 


727 


1 


EF2 METVA 


P09604 


methanococc 


37 


305 


5 


12 . 


1 


730 


1 


EF2_METBU 


093632 


methanococc 


38 


303 


12 . 


0 


728 


1 


EF2 HAL HA 


P14823 


halobacteri 


39 


296 


11. 


7 


730 


1 


EF2_METMA 


Q8pur7 


methanosarc 


40 


295 


11 . 


7 


730 


1 


EF2_METTE 


093640 


methanosarc 


41 


290 


11. 


5 


730 


1 


EF2_METAC 


Q8trc3 


methanosarc 


42 


2i 


35 


11. 


3 


730 


1 


EF2_METMT 


093637 


methanococc 


43 


225 


5 


8 . 


9 


698 


1 


EFG_DEIRA 


Q9rxk5 


deinococcus 


44 


219 


8 . 


7 


695 


1 


EFG LISMO 


Q8y421 


listeria mo 


45 


216 


8 . 


5 


695 


1 


EFG LISIN 


Q92715 


listeria in 



ALIGNMENTS 



RESULT 1 
U5S INHUMAN 

ID U5S1_HUMAN STANDARD; PRT; 972 AA. 

AC Q15 02 9; Q9BUR0; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE 116 kDa U5 small nuclear ribonucleoprotein component (U5 snRNP- 

DE specific protein, 116 kDa) (U5-116 kDa) . 

GN SNRP116 OR KIAA0031. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=960 6; 

RN [1] 

RP SEQUENCE FROM N . A. 

RC TISSUE=Bone marrow; 

RX MEDLINE=9 6 05138 7; PubMed=7 5 84026; 

RA Nomura N., Miyajima N., Sazuka T., Tanaka A., Kawarabayasi Y. , 

RA Sato S., Nagase T., Seki N., Ishikawa K.-I., Tabata S.; 

RT "Prediction of the coding sequences of unidentified human genes. I. 

RT The coding sequences of 40 new genes (KIAA0 001-KIAA004 0 ) deduced by 

RT analysis of randomly sampled cDNA clones from human immature myeloid 

RT cell line KG-1." ; 



RL DNA Res. 1:2 7-35(1994). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Muscle; 

RX MEDLINE=223 8 82 57; PubMed=12 47 7 932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T . , Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K. , Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Browns tein M.J., Usdin T.B., Toshiyuki S., Carninci P . , Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W. , Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:168 99-169 03(2002). 

RN [3] 

RP CHARACTERIZATION. 

RX MEDLINE=9 7377 04 7; PubMed=92 3 38 18 ; 

RA Fabrizio P., Laggerbauer B., Lauber J., Lane W.S., Luhrmann R. ; 

RT "An evolutionarily conserved U5 snRNP-specific protein is a GTP- 

RT binding factor closely related to the ribosomal translocase EF-2."; 

RL EMBO J. 16:4092-4106(1997). 

CC -!- FUNCTION: COMPONENT OF THE U5 SNRNP COMPLEX REQUIRED FOR PRE-MRNA 

CC SPLICING. BINDS GTP. 

CC -!- SUBCELLULAR LOCATION: Nuclear. 

CC -!- SIMILARITY: BELONGS TO THE GTP- BINDING ELONGATION FACTOR FAMILY. 
CC EF-G/EF-2 SUBFAMILY . 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

cc 

DR EMBL; D21163; BAA04699.1; -. 

DR EMBL; BC002360; AAH02360.1; 

DR SWISS-2DPAGE; Q15029; HUMAN. 

DR GK; Q15029; 

DR MIM; 603892; -. 

DR GO; GO: 0005681; C : spliceosome complex; TAS . 

DR GO; GO:0003924; F : GT P a s e activity; TAS. 

DR GO; GO: 0008248; F : pre-mRNA splicing factor activity; TAS. 

DR GO; GO: 0006371; P : mRNA splicing; TAS. 

DR InterPro; IPR000795; EF_GTPbind. 

DR InterPro; IPR000640; EFG C. 



DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



InterPro; IPR005517; EFG_IV. 
InterPro; IPR004161; EFTU_D2 . 
InterPro; IPR005225; Small_GTP . 
Pfam; PF00679; EFG_C; 1. 
Pfam; PF03764; EFG_IV; 1. 
Pfam; PF00009; GTP_EFTU; 1. 
Pfam; PF03144; GTP_EFTU_D2; 1. 
PRINTS; PR00315; ELONGATNFCT. 
TIGRFAMs; TIGR00231; small_GTP; 
PROSITE; PS00301; E FACTOR GTP; 



FALSE NEG. 



GTP-binding; mRNA splicing; Nuclear protein; Polymorphism. 

NP_BIND 136 143 GTP (POTENTIAL) . 

NP_BIND 204 208 GTP (POTENTIAL) . 

NP_BIND 258 261 GTP (POTENTIAL) . 

VARIANT 773 773 G -> V (IN dbSNP : 1056505 ) . 

/FTId=VAR_014931 . 
CONFLICT 321 321 G -> V (IN REF . 2). 

SEQUENCE 972 AA; 109435 MW; 8 62BD6CA7 993F118 CRC64; 



Query Match 98.6%; 
Best Local Similarity 99.4%; 
Matches 475; Conservative 



Score 2491; DB 1; Length 972; 
Pred. No. 3.5e-175; 
1; Mismatches 2; Indels 0; 



Gaps 



0 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 
Qy 

Db 

Qy 

Db 

Qy 

Db 



4 YSTDENLILSPLLGNVCFSSSQYSICFTLGSFAKI YADTFGDINYQEFAKRLWGDIYFNP 63 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I II I I I I I I I i I I I I I I I I I I I I I 1 I I I I 

2 92 YSTDENLILSPLLGNVCFSSSQYSICFTLGSFAKI YADTFGDINYQEFAKRLWGDIYFNP 351 

64 KTRKFTKKAPTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLN 123 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I t I I I It I M M I I I I I I I I 1 I I I I 

352 KTRKFTKKAPTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLN 411 

124 IRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDPD 183 

I I I I II I I I I I I I I I I I I I I I II I I I I I M M I I I I I I I I I M E 1 I I I I I I I I I I i I I I I 

412 IRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDPD 471 



184 



243 



GPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLW 

I I I I I I I I I I : i I II I I I I 1 I I 1 I I I I I II I M I I I I I I I I I I I I I I I I I I I I I M I I 
472 GPLMCHTTKMYSTDDGVQFHAFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLW 531 

244 ISVT^YHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIA 303 

I I I I I M I I I I I I I I I I II I I I I I I I I 11 i I I IE I I I I I I I I I I I I I I I I I I I I I I I I I I 
532 ISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIA 591 

304 VEPWPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEI 3 63 

I I M I I I I I I I I I I I I I I I I 1 I i I I I I ! I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 
5 92 VEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEI 651 

364 DIKVADPVVTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWNR 423 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I 
652 DIKVADPVVTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWNR 711 

424 KKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGSVKDSIVQG 481 

I I I I I I I 1 I I I I | I I I It i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I 
712 KKLGEFFQTKYDWDLLAARS I WAFGPDAT GPNI LVDDTL P S EVDKALLGSVKDS I VQG 7 69 



RESULT 2 



U5Sl_MOUSE 

ID U5Sl_MOUSE STANDARD; PRT; 971 AA. 

AC 008810; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE 116 kDa U5 small nuclear ribonucleoprotein component (U5 snRNP- 

DE specific protein, 116 kDa) (U5-116 kDa) . 

GN SNRP116. 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=9 737 7047 ; PubMed=9233 818 ; 

RA Fabrizio P. , Laggerbauer B., Lauber J., Lane W.S., Luehrmann R. ; 

RT "An evolutionarily conserved U5 snRNP-speci f ic protein is a GTP- 

RT binding factor closely related to the ribosomal translocase EF-2."; 

RL EMBO J. 16:4092-4106(1997). 

RN [2] 

RP SEQUENCE OF 401-971 FROM N.A. 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B . , Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T . , Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L., Marusina K. , Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P. J., McKernan K.J., Malek J. A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E . , Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W. , Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

cc _(_ FUNCTION: COMPONENT OF THE U5 SNRNP COMPLEX REQUIRED FOR PRE-MRNA 

CC SPLICING. BINDS GTP. 

CC -!- SUBCELLULAR LOCATION: Nuclear. 

CC -!- SIMILARITY: BELONGS TO THE GTP- BINDING ELONGATION FACTOR FAMILY. 

CC EF-G/EF-2 SUBFAMILY . 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 



DR EMBL; U97079; AAC53299.1; 

DR EMBL; BC012636; AAH12636.1; -. 

DR MGD; MGI : 1336880; Snrpll6. 

DR InterPro; IPR000795; EF_GTPbind. 

DR InterPro; IPR000640; EFG_C. 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; IPR005225; Small_GTP. 

DR Pfam; PF00679; EFG_C; 1. 

DR Pfam; PF037 64; EFG_IV; 1. 

DR Pfam; PF00009; GTP_E FTU; 1. 

DR Pfam; PF03144; GTP_EFTU_D2; 1. 

DR PRINTS; PR00315; ELONGATNFCT. 

DR TIGRFAMs; TIGR00231; small_GTP; 1. 

DR PROSITE; PS00301; E FACTOR GTP; FALSE NEG . 



KW 


GTP-binding; mRNA 


splicing; 


Nuclear protein. 


FT 


NP_BIND 


135 


142 


GTP ( POTENTIAL) . 


FT 


NP_BIND 


203 


207 


GTP (POTENTIAL) . 


FT 


NP_BIND 


257 


260 


GTP (POTENTIAL) . 


FT 


CONFLICT 


401 


404 


HLTK -> PRVR (IN REF. 2) . 


SQ 


SEQUENCE 


971 AA; 


109360 


MW; 0ECF1661DEA3A7FC CRC64 ; 



Query Match 98.4%; Score 2487; DB 1; Length 971; 

Best Local Similarity 99.2%; Pred. No. 6.9e-175; 

Matches 474; Conservative 2; Mismatches 2; Indels 0; Gaps 0; 

Qy 4 YSTDENLILSPLLGNVCFSSSQYSICFTLGSFAKIYADTFGDINYQEFAKRLWGDIYFNP 63 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I M I I 
Db 291 YSTDENLILSPLLGNVCFSSSQYSICFTLGSFAKIYADTFGDINYQEFAKRLWGDIYFNP 350 

Qy 64 KTRKFTKKAPTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLN 123 

I I M II M I I : I I I I I I I I I I I i I I I I I I i I I I I I I I I I I I I I I I I I i I I I I I I I I I M I 

Db 351 KTRKFTKKAPSSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLN 410 

Qy 124 I RPLLRLVCKKFFGEFTGFVDMCVQHI P S PKVGAKPKI EHTYTGGVDS DLGEAMS DCDPD 183 

II I I t I I I i I I I I I It I I I I M I I I I I I I I I I I I I I I I I I I i I M I I I I I I I I I I I I I I I 

Db 411 I RPLLRLVCKKFFGEFTGFVDMCVQHI PS PKVGAKPKI EHTYTGGVDSDLGEAMS DCDPD 470 

Qy 184 GPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLW 243 

1111111111:11 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I ! I I I I II I I I 
Db 471 GPLMCHTTKMYSTDDGVQFHAFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLW 530 

Qy 244 ISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIA 303 

I I I I I I I I I II I I I I I I I I M I I I I I I I I M I I I I I M I I I I I I I I I I I I I I I I I I I I I I 
Db 531 ISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIA 590 

Qy 304 VEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEI 363 

I I I I I I I I I I I I I I I I I I I I II II I i I I I I I I I I I I I I II I I I I I 1 I I I I I I I I I I I I I I 
Db 591 VEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEI 650 

Qy 364 DIKVADPWTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGI^DIENEWQITWNR 423 

I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I i I I I I 
Db 651 DIKVADPVVTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWNR 710 

Qy 424 KKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGSVKDSIVQG 481 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I 
Db 711 KKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGSVKDSIVQG 768 



RESULT 3 
EF2 HUMAN 



ID EF2_HUMAN STANDARD; PRT; 857 AA. 

AC P13639; 

DT 01-JAN-1990 (Rel. 13, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Elongation factor 2 (EF-2) . 

GN EEF2 OR EF2 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Ovary; 

RX MEDLINE=9 0 12 174 1 ; PubMed=2 610926; 

RA Rapp G. , Klaudiny J., Hagendorff G. , Luck M.R., Heinz K. ; 

RT "Complete sequence of the coding region of human elongation factor 2 

RT (EF-2) by enzymatic amplification of cDNA from human ovarian 

RT granulosa cells."; 

RL Biol. Chem. Hoppe-Seyler 370:1071-1075(1989). 

RN [2] 

RP SEQUENCE OF 500-857 FROM N.A. 

RX MEDLINE=8 8 2 93714; PubMed=2 840927; 

RA Rapp G., Mucha J., Einspanier R. , Luck M., Scheit K.H.; 

RT "Cloning and sequence analysis of a cDNA from human ovarian granulosa 

RT cells encoding the C-terminal part of human elongation factor 2."; 

RL Biol. Chem. Hoppe-Seyler 369:24 7-250(198 8). 

RN [3] 

RP CLEAVAGE OF INITIATOR METHIONINE. 

RA Bienvenut W.V. ; 

RL Unpublished observations (AUG-2001) . 

CC -!- FUNCTION: THIS PROTEIN PROMOTES THE GTP- DEPENDENT TRANSLOCATION 
CC OF THE NASCENT PROTEIN CHAIN FROM THE A-SITE TO THE P-SITE OF THE 

CC RIBOSOME. 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic. 

CC -!- PTM: PHOSPHORYLATION BY EF-2 KINASE COMPLETELY INACTIVATES EF-2. 

CC -!- PTM: HIS-714 IS MODIFIED IN DIPHTHAMIDE ( 2- [ 3-CARBOXYAMIDO-3- 
CC (TRIMETHYL-AMMONIO) PROPYL] HISTIDINE) . DIPHTHAMIDE CAN BE ADP- 

CC RIBOSYLATED BY DIPHTERIA TOXIN AND BY PSEUDOMONAS EXOTOXIN A. 

CC -!- SIMILARITY: BELONGS TO THE GTP -BINDING ELONGATION FACTOR FAMILY. 
CC EF-G/EF-2 SUBFAMILY. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; M19997; AAA50388.1; -. 

DR EMBL; X51466; CAA35829.1; -. 

DR EMBL; Z11692; CAA77750.1; -. 



DR PIR; S18294; EFHU2 . 

DR HSSP; P13551; 1FNM. 

DR PMMA-2DPAGE; P13639; -. 

DR Genew; HGNC:3214; EEF2 . 

DR MIM; 130610; -. 

DR InterPro; IPR000795; EF_GTPbind. 

DR InterPro; IPR000640; EFG_C . 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; IPR005225; Small_GTP. 

DR Pfam; PF0 0 67 9; EFG_C; 1. 

DR Pfam; PF03764; EFG_IV; 1. 

DR Pfam; PF00009; GTP_EFTU; 1. 

DR Pfam; PF03144; GTP_EFTU_D2; 1. 

DR PRINTS; PR00315; ELONGATNFCT . 

DR TIGRFAMs; TIGR00231; small_GTP; 1. 

DR PROSITE; PS00301; E FACT O R_GT P ; 1. 

KW Elongation factor; GTP-binding; Protein biosynthesis; Phosphorylation. 

FT INIT_MET 0 0 

FT NP_BIND 25 32 GTP (BY SIMILARITY) . 

FT NP_BIND 103 107 GTP (BY SIMILARITY) . 

FT NP_BIND 157 160 GTP (BY SIMILARITY) . 

FT MOD_RES 56 56 PHOSPHORYLATION (BY SIMILARITY) . 

FT MOD__RES 58 58 PHOSPHORYLATION (BY SIMILARITY) . 

FT MOD_RES 714 714 DIPHTHAMIDE. 

SQ SEQUENCE 857 AA; 95207 MW; 66421C8133EAC22E CRC64; 



Query Match 34.4%; Score 870; DB 1; Length 857; 

Best Local Similarity 37.3%; Pred. No. 3.9e-56; 

Matches 182; Conservative 95; Mismatches 189; Indels 22; 



Gaps 



Qy 

Db 



9 NLILSPLLGNVCFSSSQYSICFTLGSFAKI YADTF GDI NYQEFAKRLW 56 

I : : : I : I I I I I : III I I : : I I I : : : I : I I 

2 01 NIMIDPVLGTVGFGSGLHGWAFTLKQFAEMYVAKFAAKGEGQLGPAERAKKVEDMMKKLW 2 60 



QY 
Db 

Qy 

Db 

Qy 

Db 



57 GDI YFNPKTRKFTKKAPTSSSQ RSFVEFILEPLYKILAQWGDVDTSLPRTLDELGI 113 

I I I I : I I t : I I : : | : | : || : | : : I : : : : : : : ! | 

2 61 GDRYFDPANGKFSKSATSPEGKKLPRTFCQLILDPIFKVFDAIMNFKKEETAKLIEKLDI 32 0 

114 HLTKEELKLNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPICVG7VKPKIEHTYTGGVDSDL 173 

I I : : M I : I : : : : I I : I I I I : I II 1 : 

321 KLDSEDKDKEGKPLLKAVMRRWLPAGDALLQMITIHLPSP VTAQKYRCELLYEGPPDDEA 380 

174 GEAMSDCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEED 233 

: I I I I I II : : I I II : I : I I II I I : I I : : : I I I I : I I 

381 AMGIKSCDPKGPLMMYISKMVPTSDKGRFYAFGRVFSGLVSTGLKVRIMGPNYTPGKKED 44 0 



Qy 

Db 

Qy 

Db 

Qy 



234 SQICTVGRLWISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLK 293 

: : I : : I : I I I I I : I I I I : I I I I M II I : I 

441 LYLKPIQRTILMMGRYVEPIEDVPCGNIVGLVGVDQFLVKTGTITT FEHAHNMRVMK 497 

2 94 FNTTSVIKIAVEPVNPSELPKMLDGLRKA/NKS YPSLTTKVEESGEHVILGTGELYLDCVM 353 

I : : I : : : I I I I | : : | | | : : : I | : : : II I : : I I I I I I : I I I I I : I : : 
498 FSVSPWRVAVEAKNPADLPKLVEGLKRLAKSDPMVQCIIEESGEHIIAGAGELHLEICL 557 

354 HDLRKMYSEIDIKVADPVVTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIE 413 
II : : : I II : I I I I : : III I I : : I : : : I I I I : : I I I I I I I I I : 



Db 558 KDLEEDHACI PIKKSDPVVSYRETVSEESNVLCLSKSPNKHNRLYMKARPFPDGLAEDID 617 



Qy 414 NEVVQITWNRKKLGEFFQTKYDWDLIAARSIWAFGPDATGPNILVDDTLPSEVDKALLGS 473 

I I : : I I : I I : I I I I I I I I I I I I I I I i : I 

Db 618 KGEVSARQELKQRARYLAEKYEWDVAEARKIWCFGPDGTGPNILTDITKGVQ YLNE 673 

Qy 474 VKDSIVQG 481 

: If I : I I 
Db 674 IKDSWAG 681 



RESULT 4 
EF2 MOUSE 



ID EF2_MOUSE STANDARD; PRT ; 857 AA. 

AC P58252; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-SEP-2003 {Rel. 42, Last annotation update) 

DE Elongation factor 2 (EF-2) . 

GN EEF2 . 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N . A. 

RX MEDLINE=2 23 8 82 57; PubMed=12 4 7 7 932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L . , Marusina K., Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S . J" . , 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S. f Garcia A.M., Gay L . J . , Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X . , Gibbs R.A. , 

RA Fahey J,, Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W. , Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N. , Krzywinski M.I. - , Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

CC -!- FUNCTION: THIS PROTEIN PROMOTES THE GTP- DEPENDENT TRANSLOCATION 
CC OF THE NASCENT PROTEIN CHAIN FROM THE A-SITE TO THE P-SITE OF THE 

CC RIBOSOME. 

CC SUBCELLULAR LOCATION: Cytoplasmic. 

CC -!- PTM: PHOSPHORYLATION BY EF-2 KINASE COMPLETELY INACTIVATES EF-2 
CC (By similarity) . 

CC -!- PTM: HIS-714 IS MODIFIED IN DIPHTHAMIDE ( 2- [ 3-CARBOXYAMIDO-3- 
CC (TRIMETHYL-AMMONIO) PROPYL ] HI STI DINE ) . DIPHTHAMIDE CAN BE ADP- 

CC RIBOSYLATED BY DIPHTERIA TOXIN AND BY PSEUDOMONAS EXOTOXIN A (By 

CC similarity) . 



CC -!- SIMILARITY: BELONGS TO THE GTP- BINDING ELONGATION FACTOR FAMILY. 
CC EF-G/EF-2 SUBFAMILY. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement {See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; BC007152; AAH07152.1; -. 

DR MGD; MGI: 95288; Eef2. 

DR InterPro; IPR000795; EF_GTPbind. 

DR InterPro; IPR000640; EFG__C . 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; I PRO 0522 5; Small_GTP. 

DR Pfam; PF00679; EFG_C; 1. 

DR Pfam; PF037 64; EFG_IV; 1. 

DR Pfam; PF00009; GTP_EFTU; 1. 

DR Pfam; PF03144; GTP_EFTU_D2; 1. 

DR PRINTS; PR00315; ELONGATNFCT. 

DR TIGRFAMs; TIGR00231; small_GTP; 1. 

DR PROSITE; PS00301; EFACTOR_GTP; 1. 

KW Elongation factor; GTP-binding; Protein biosynthesis ; Phosphorylation . 



FT 


INITJVIET 


0 


0 


BY SIMILARITY. 


FT 


NP_BIND 


25 


32 


GTP (BY SIMILARITY) . 


FT 


NP_BIND 


103 


107 


GTP (BY SIMILARITY) . 


FT 


NP_BIND 


157 


160 


GTP (BY SIMILARITY) . 


FT 


MOD_RES 


56 


56 


PHOSPHORYLATION (BY SIMILARITY) 


FT 


MOD_RES 


58 


58 


PHOSPHORYLATION (BY SIMILARITY) 


FT 


MOD_RES 


714 


714 


DIPHTHAMIDE (BY SIMILARITY) . 


SQ 


SEQUENCE 


857 AA; 


95182 


MW; 1B96D4 6A8B9B7A44 CRC64; 



Query Match 34.4%; Score 869; DB 1; Length 857; 

Best Local Similarity 37.7%; Pred. No. 4.6e-56; 

Matches 184; Conservative 93; Mismatches 189; Indels 22; Gaps 5; 

Qy 9 NLILSPLLGNVCFSSSQYSICFTLGSFAKI YADTF GDINYQEFA KRLW 56 

1 : : : I : I I I I I : III I I : : I I I : : II I : I I 

Db 201 NIMIDPVLGTVGFGSGLHGWAFTLKQFAEMYVAKFAAKGEGQLSAAERAKKVEDMMKKLW 260 

Qy 57 GDIYFNPKTRKFTKKAPTSSSQ RSFVEFILEPLYKILAQWGDVDTSLPRTLDELGI 113 

I I I I : I | | : | I : : I : I : I I : I : : I : : : : : : : I I 

Db 2 61 GDRYFDPANGKFSKSANSPDGKKLPRTFCQLILDPIFKVFDAIMNFRKEETAKLIEKLDI 32 0 



Qy 114 HLTKEELKLNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDL 173 

I I : : | | | : | : : : : | I : I I I I : I II I : 

Db 321 KLDSEDKDKEGKPLLKAVMRRWLPAGDALLQMITIHLPSPVTAQKYRCELLYEGPPDDEA 380 

Qy 17 4 GEAMSDCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQ PVKVLGENYTLEDEED 233 

: I I I I I I I : : I I II : I : I I I I I I : I I : : : I I I I =11 
Db 381 AMGIKSCDPKGPLMMYISKMVPTSDKGRFYAFGRVFSGWSTGLKVRIMGPNYTPGKKED 440 

Qy 234 S Q I CT VGRLW I S VAR YH I E VN RVP AGNWVL I EGVDQ P I VKT AT I T E P RGN EEAQ I FRP L K 293 

: : I : : I I : I I M I : I II I : I I I I I I II I : I 



Db 



441 L YLKP I QRT I LMMGRYVE P I EDVPCGN I VGLVGVDQFLVKTGT ITT FEHAHNMRVMK 497 



Qy 294 FNTTSVIKIAVEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVM 353 

I : : I : : : I II | | : : I | I : : : I I : : : II I : : I M I I I : I I I i I : I : : 
Db 498 F S VS P VVRVAVEAKN PAD L P KLVE GL KRLAK S D PMVQ CIIEESGEHI I AGAG ELHLEICL 557 

Qy 354 HDLRKMYSEIDIPCVADPVVTFCETVVETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIE 413 

II : : : I II : I I I I : : III I I : : I : : : I I I I : : I I I MINI: 
Db 558 KDLEEDHACIPIKKSDPWSYRETVSEESNVLCLSKSPNKHNRLYMKARPFPDGLAEDID 617 

Qy 414 NEWQITWNRKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGS 473 

I I : I I : I I : I I M I II I I I I II I I I : I 
Db 618 KGEVSARQELKARARYIAEKYEWDVAEARKIWCFGPDGTGPNILTDITKGVQ YLNE 673 



Qy 


474 VKDSIVQG 481 
: 1 1 1 : 1 1 




Db 


674 I KD SWAG 681 




RESULT 5 




EF2 


RAT 




ID 


EF2 RAT STANDARD; PRT; 8 57 AA. 




AC 


P05197; P97619; 




DT 


13-AUG-1987 (Rel. 05, Created) 




DT 


16-OCT-2001 (Rel. 40, Last sequence update) 




DT 


28-FEB-2003 (Rel. 41, Last annotation update) 




DE 


Elongation factor 2 (EF-2) . 




GN 


EEF2 . 




OS 


Rattus norvegicus (Rat) . 




oc 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 


oc 


Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 


Rattus . 


ox 


NCBI TaxID=10116; 




RN 


[1] 




RP 


SEQUENCE OF 1-693 FROM N.A. 




RC 


TISSUE=Liver; 




RX 


MEDLINE=8 9252 02 8 ; PubMed-272 1 670 ; 




RA 


Oleinikov A. V. , Jokhadze G.G., Alakhov Y.B.; 




RT 


"Primary structure of rat liver elongation factor 2 deduced from the 


RT 


cDNA sequence . " ; 




RL 


FEBS Lett. 24 8:131-136(198 9). 




RN 


[2] 




RP 


SEQUENCE OF 515-857 FROM N.A. 




RX 


MEDLINE=8 6259716; PubMed=30 14523 ; 




RA 


Kohno K. , Uchida T., Ohkubo H., Nakanishi S., Nakanishi T . , 




RA 


Fukui T., Ohtsuka E . , Ikehara M. , Okada Y.; 




RT 


"Amino acid sequence of mammalian elongation factor 2 deduced 


from 


RT 


the cDNA sequence: homology with GTP-binding proteins. 




RL 


Proc. Natl. Acad. Sci. U.S.A. 83:4 978-4 982(1986). 




RN 


[3] 




RP 


SEQUENCE OF 507-557 FROM N.A. 




RC 


STRAIN=Wistar Kyoto; TISSUE=Aortic smooth muscle; 




RA 


Adams L.A., Werny I., Schwartz S.M.; 




RL 


Submitted (OCT-1996) to the EMBL/GenBank/DDB J databases . 




RN 


[4] 




RP 


SEQUENCE OF 549-857 FROM N.A. 




RC 


STRAIN=Noble; 




RA 


Too C.K.L. ; 





RL Submitted (MAY-1997) to the EMBL/GenBank/DDBJ databases. 

RN [5] 

RP SEQUENCE OF 701-715. 

RC TISSUE=Liver; 

RX MEDLINE=7 4 3 012 60; PubMed=4 368673; 

RA Robinson E.A., Henriksen O., Maxwell E.S.; 

RT "Elongation factor 2. Amino acid sequence at the site of adenosine 

RT diphosphate ribosylation. "; 

RL J. Biol. Chem. 24 9:5088-5093(1974). 

RN [6] 

RP SEQUENCE OF 2-18 AND 50-59, AND PHOSPHORYLATION ♦ 

RC TISSUE=Pancreas ; 

RX MEDLINE=8 8 087 003; PubMed=3693353 ; 

RA Nairn A.C., Palfrey H.C.; 

RT "Identification of the major Mr 100,000 substrate for calmodulin- 

RT dependent protein kinase III in mammalian cells as elongation 

RT factor-2 . "; 

RL J. Biol. Chem. 2 62:172 99-17 303(1987). 

CC -!- FUNCTION: THIS PROTEIN PROMOTES THE GTP- DEPENDENT TRANSLOCATION 
CC OF THE NASCENT PROTEIN CHAIN FROM THE A-SITE TO THE P-SITE OF THE 

CC RIBOSOME. 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic. 

CC -!- PTM: PHOSPHORYLATION BY EF-2 KINASE COMPLETELY INACTIVATES EF-2. 

CC -!- PTM: HIS-714 IS MODIFIED IN DIPHTHAMIDE ( 2 - [ 3 — CARBOXYAMI DO- 3 - 
CC (TRIMETHYL-AMMONIO) PROPYL] HISTIDINE) . DIPHTHAMIDE CAN BE ADP- 

CC RIBOSYLATED BY DIPHTERIA TOXIN AND BY PSEUDOMONAS EXOTOXIN A. 

CC -!- SIMILARITY: BELONGS TO THE GTP- BINDING ELONGATION FACTOR FAMILY. 
CC EF-G/EF-2 SUBFAMILY. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch ) . 

CC 

DR EMBL; K03502; AAA41106.1; -. 

DR EMBL; Y07504; CAA68805.1; -. 

DR EMBL; U75403; AAB19107.1; 

DR EMBL; AF000576; AAD05363.1; -. 

DR PIR; S 04 0 07; EFRT2 . 

DR HSSP; P13551; 1FNM. 

DR InterPro; IPR000795; EF_GTPbind. 

DR InterPro; IPR000640; EFG_C. 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; IPR005225; Small_GTP . 

DR Pfam; PF0 0 67 9; EFG_C; 1. 

DR Pfam; PF037 64; EFG_IV; 1. 

DR Pfam; PF00009; GTP_EFTU; 1. 

DR Pfam; PF03144; GTP_EFTU__D2 ; 1. 

DR PRINTS; PR00315; ELONGATNFCT . 

DR TIGRFAMs; TIGR00231; small_GTP; 1. 

DR PROSITE; PS00301; EFACTOR_GTP; 1. 

KW Elongation factor; GTP-binding; Protein biosynthesis; Phosphorylation. 

FT INIT MET 0 0 BY SIMILARITY. 



FT NP_BIND 


25 


32 




GTP (BY SIMILARITY) . 


FT NP_BIND 


103 


107 




GTP (BY SIMILARITY) . 


FT NP_BIND 


157 


160 




GTP (BY SIMILARITY) . 


FT MOD_RES 


56 


56 




PHOSPHORYLATION (BY SIMILARITY) 


FT MOD_RES 


58 


58 




PHOSPHORYLATION (BY SIMILARITY) 


FT MOD_RES 


714 


714 




DIPHTHAMIDE. 


SQ SEQUENCE 


857 AA; 


95152 


MW; C808AD8ABD527E85 CRC64; 


Query Match 




34. 


3 5 


h; Score 868; DB 1; Length 857; 


Best Local 


Similarity 


37. 


7 £ 


h; Pred. No. 5.4e-56; 



Matches 184; Conservative 92; Mismatches 190; Indels 22; Gaps 5 

Qy 9 NLILSPLLGNVCFSSSQYSICFTLGSFAKI YADTF GDINYQEFA- KRLW 56 

I : : : i : I I I I I : Ml i I : : I I I : II I : I I 

Db 201 NIMIDPVLGTVGFGSGLHGWAFTLKQFAEMYVAKFAAKGEGQLGAAERAKKVEDMMKKLW 260 

Qy 57 GDIYFNPKTRKFTKKAPTSSSQ RSFVEFILEPLYKILAQWGDVDTSLPRTLDELGI 113 

I I I I : I I I : I I : : | : | : | | : | : : | : : : : : : : I I 

Db 2 61 GDRYFDPANGKFSKSANSPDGKKLPRTFCQLILDPIFKVFDAIMNFRKEETAKLIEKLDI 32 0 

Qy 114 HLTKEELKLNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVG7VKPKIEHTYTGGVDSDL 173 

I I : : | M : I : : : : I I : I I ! I : I II I : 

Db 321 KLDSEDKDKEGKPLLKAVMRRWLPAGDALLQMITIHLPSPVTAQKYRCELLYEGPPDDEA 380 

Qy 174 GEAMSDCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEED 233 

: I M I I I I : : II II : I : II I I I I : I I : : : I I I I : I I 
Db 381 AMGIKSCDPKGPLMMYISKMVPTSDKGRFYAFGRVFSGWSTGLKVRIMGPNYTPGKKED 440 

Qy 234 SQICTVGRLWISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLK 293 

: : I : : I I : I I I I I : I I II : I M M I II I : I 

Db 441 L YLKP I QRT I LMMGRYVEP I EDVP CGNI VGLVGVDQ FLVKT GT ITT FEHAHNMRVMK 497 

Qy 294 FNTTSVIKIAVEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVM 353 

I : : I : : : I I I | | : : | | | : : : | | : : : II I : : I I i I I i : I I I I I : i : : 
Db 498 FSVSPWRVAVEAKNPADLPKLVEGLKRLAKSDPMVQCIIEESGEHIIAGAGELHLEICL 557 

Qy 354 HDLRKMYSEIDIKVADPVVTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIE 413 

II : : : I II : I 1 I I : : III I I : : I : : : I I I I : : I II I I I I I I : 

Db 558 KDLEEDHACIPIKKSDPWSYRETVSEESNVLCLSKSPNKHNRLYMKARPFPDGL7VEDID 617 

Qy 414 NEWQITWNRKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGS 473 

I I : I I : I I : I I I I I I I I I I I I I I I I : I 

Db 618 KGEVSARQELKARARYLAEKYEWDVAEARKIWCFGPDGTGPNILTDITKGVQ YLNE 673 

Qy 474 VKDSIVQG 481 

: I I I : I I 
Db 67 4 IKDSWAG 681 



RESULT 6 
EF2_CRIGR 

ID EF2_CRIGR STANDARD; PRT; 857 AA. 

AC P09445; 

DT 01-MAR-1989 (Rel. 10, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 
DE Elongation factor 2 (EF-2) . 



GN EEF2 . 

OS Cricetulus griseus (Chinese hamster) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Cricetinae; 

OC Cricetulus. 

OX NCBI_TaxID=10 02 9; 

RN [1] 

RP SEQUENCE FROM N . A. 

RX MEDLINE=8 8 198187; PubMed=2 834376; 

RA Nakanishi T . f Kohno K., Ishiura M. , Ohashi H., Uchida T . ; 

RT "Complete nucleotide sequence and characterization of the 5 '-flanking 

RT region of mammalian elongation factor 2 gene."; 

RL J. Biol. Chem. 2 63:6384-6391(19 8 8). 

CC -!- FUNCTION: THIS PROTEIN PROMOTES THE GTP- DEPENDENT TRANSLOCATION 
CC OF THE NASCENT PROTEIN CHAIN FROM THE A-SITE TO THE P-SITE OF THE 

CC RIBOSOME. 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic. 

CC -!- PTM: PHOSPHORYLATION BY EF-2 KINASE COMPLETELY INACTIVATES EF-2. 

cc _i_ PTM: HIS-714 IS MODIFIED IN DIPHTHAMIDE ( 2- [ 3-CARBOXYAMIDO-3- 
CC (TRIMETHYL-AMMONIO) PROPYL] HISTIDINE) . DIPHTHAMIDE CAN BE ADP- 

CC RIBOSYLATED BY DIPHTERIA TOXIN AND BY PSEUDOMONAS EXOTOXIN A. 

CC -!- SIMILARITY: BELONGS TO THE GTP- BINDING ELONGATION FACTOR FAMILY. 
CC EF-G/EF-2 SUBFAMILY. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch) . 

CC 

DR EMBL; J03200; AAA50386.1; -. 

DR HSSP; P13551; 1FNM. 

DR InterPro; IPR000795; EF_GTPbind. 

DR InterPro; IPR000640; EFG_C . 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; IPR005225; SmallJGTP. 

DR Pfam; PF0 067 9; EFG_C; 1. 

DR Pfam; PF03764; EFG_IV; 1. 

DR Pfam; PF00009; GTP_EFTU; 1. 

DR Pfam; PF03144; GTP_EFTU_D2; 1. 

DR PRINTS; PR00315; ELONGATNFCT . 

DR TIGRFAMs; TIGR00231; small_GTP; 1. 

DR PROSITE; PS00301; EFACTOR_GTP; 1. 

KW Elongation factor; GTP-binding; Protein biosynthesis; Phosphorylation. 

FT INIT__MET 0 0 BY SIMILARITY. 

FT NP_BIND 25 32 GTP (BY SIMILARITY) . 

FT NP_BIND 103 107 GTP (BY SIMILARITY) . 

FT NP_BIND 157 160 GTP (BY SIMILARITY) . 

FT MOD_RES 5 6 5 6 PHOSPHORYLATION (BY SIMILARITY) . 

FT MOD~RES 58 58 PHOSPHORYLATION (BY SIMILARITY) . 

FT MOD_RES 714 714 DIPHTHAMIDE. 

SQ SEQUENCE 857 AA; 95264 MW; DCE0A55CFBCB8 8 8 6 CRC64 ; 



Query Match 



34.3%; Score 867; DB 1; Length 857; 



Best Local Similarity 37.3%; Pred. No. 6.4e-56; 

Matches 182; Conservative 94; Mismatches 190; Indels 22; Gaps 5 



Qy 9 NLILSPLLGNVCFSSSQYSICFTLGSFAKIYADTF GDI NYQE FAKRLW 56 

I : : : I : II I I I : III M : : ! I I : : : I : I I 

Db 201 NIMIDPVLGTVGFGSGLHGWAFTLKQFAEMYVAKFT^AKGEGQLGPAERAKKVEDMMKKLW 260 

Qy 57 GDIYFNPKTRKFTKKAPTSSSQ RSFVEFILEPLYKILAQWGDVDTSLPRTLDELGI 113 

I I I I : I M : I ! : : | : | : | | : | : : | : : : : : : : I I 

Db 261 GDRYFDPANGKFSKSANSPDGKKLPRTFCQLILDPIFKVFDPIMNFRKEETAKLIEKLDI 320 

Qy 114 HLTKEELKLNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDL 173 

I I : : | | | : | : : : : I I : I I I I : I II I : 

Db 321 KLDSEDKDKEGKPLLKAVMRRWLPAGDALLQMITIHLPSPVTAQKYRCELLYEGPPDDEA 380 

Qy 174 GE7\MSDCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEED 233 

: I I I I I I I :: I I II : I : I I I j I I : I I : : : I I I I : I I 
Db 381 AMGIKSCDPKGPLMMYISKMVPTSDKGRFYAFGRVFSGWSTGLKVRIMGPNYTPGKKED 440 

Qy 234 SQICTVGRLWISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLK 293 

: : I : : I I : II IE I : MM : I I I Ml II I M 

Db 441 LYLKPIQRTILMMGRYVEPIEDVPCGNIVGLVGVDQFLVKTGTITT FEHAKNMRVMK 497 

Qy 294 FNTTS VIKIAVE PVNPSELPKMLDGLRKVNKS YPSLTTKVEESGEHVILGTGELYLDCVM 353 

I : : I : : : I M I I : M I I : : M I : : : II I : M II II I M I M I : I : : 
Db 498 FSVSPWRVAVEAKNPADLPKLVEGLKRLAKSDPMVQCIIEESGEHIIAGAGELHLEICL 557 

Qy 354 HDLRKMYSEIDIKVADPVWFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIE 413 

I I : : : I II =1111 = = III I I : : I : : : M I I : : i I I II I I ! I : 
Db 558 KDLEEDHACIPIKKSDPWSYRETVSEESNVLCLSKSPNKHNRLYMKARPFPDGLAEDID 617 

Qy 414 NEWQITWNRKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGS 473 

I I : I I : I I : II II I I I I I I I II I I I : I 

Db 618 KGEVSARQELKARARYLAEKYEWDVAEARKIWCFGPDGTGPNILTDITKGVQ YLNE 673 

Qy 47 4 VKDSIVQG 4 81 

M I I : I I 
Db 67 4 IKDSWAG 681 



RESULT 7 


EF2 


CHICK 


ID 


EF2 CHICK STANDARD; PRT; 857 AA. 


AC 


Q90705; 


DT 


01-NOV-1997 (Rel. 35, Created) 


DT 


16-OCT-2001 (Rel. 40, Last sequence update) 


DT 


28-FEB-2003 (Rel. 41, Last annotation update) 


DE 


Elongation factor 2 (EF-2) . 


GN 


EEF2 . 


OS 


Gallus gallus (Chicken) . 


OC 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 


OC 


Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Phasianinae; 


OC 


Gallus . 


OX 


NCBI TaxID=9031; 


RN 


[1] 


RP 


SEQUENCE FROM N.A. 


RC 


TISSUE=Intestine; 



RA Kim C.W., Jung E.J., Ahn H.J., Kim J.C., Kang K.R., Eom M.-O., 

RA Kim Y.W., Kang Y.-S.; 

RT "Molecular cloning of chicken elongation factor 2 (EF-2): sequence 

RT comparison with mammalian EF-2 and its expression in the early 

RT developmental stages of the embryos."; 

RL Mol. Cells 3:27-33(1993). 

RN [2] 

RP SEQUENCE OF 1-21, AND PHOSPHORYLATION. 

RX MEDLINE=91207327; PubMed=1708237 ; 

RA Kim Y.W., Kim C.W., Kang K.R., Byun S.M., Kang Y.S.; 

RT "Elongation factor-2 in chick embryo is phosphorylated on tyrosine as 

RT well as serine and threonine."; 

RL Biochem. Biophys . Res. Commun. 17 5:4 00-4 06(1991). 

CC -!- FUNCTION: THIS PROTEIN PROMOTES THE GTP- DEPENDENT TRANSLOCATION 
CC OF THE NASCENT PROTEIN CHAIN FROM THE A-SITE TO THE P-SITE OF THE 

CC RIBOSOME. 

CC SUBCELLULAR LOCATION: Cytoplasmic. 

CC -!- PTM: PHOSPHORYLATION BY EF-2 KINASE COMPLETELY INACTIVATES EF-2 
CC (BY SIMILARITY) . 

CC -!- PTM: HIS-714 IS MODIFIED IN DIPHTHAMIDE ( 2- [ 3-CARBOXYAMIDO-3- 
CC (TRIMETHYL-AMMONIO) PROPYL] HISTIDINE) . DIPHTHAMIDE CAN BE ADP- 

CC RIBOSYLATED BY DIPHTERIA TOXIN AND BY PSEUDOMONAS EXOTOXIN A (BY 

CC SIMILARITY) . 

CC -!- SIMILARITY: BELONGS TO THE GTP- BINDING ELONGATION FACTOR FAMILY. 
CC EF-G/EF-2 SUBFAMILY. 

CC J 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U46663; AAA87587.1; -. 

DR HSSP; P13551; 1FNM. 

DR InterPro; IPR000795; EF_GTPbind. 

DR InterPro; IPR000640; EFG_C . 

DR InterPro; IPR005517; EFG__IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; IPR005225; Small_GTP. 

DR Pfam; PF0067 9; EFG_C; 1. 

DR Pfam; PF037 64; EFG_IV; 1. 

DR Pfam; PF00009; GTP_EFTU; 1. 

DR Pfam; PF03144; GTP_EFTU_D2; 1. 

DR PRINTS; PR00315; ELONGATNFCT . 

DR TIGRFAMs; TIGR00231; small_GTP; 1. 

DR PROSITE; PS00301; EFACTORJGTP; 1. 

KW Elongation factor; GTP-binding; Protein biosynthesis; Phosphorylation. 

FT INIT_MET 0 0 

FT NP_BIND 25 32 GTP (BY SIMILARITY) . 

FT NP_BIND 103 107 GTP (BY SIMILARITY) . 

FT NP_BIND 157 160 GTP (BY SIMILARITY) . 

FT MOD_RES 56 56 PHOSPHORYLATION (BY SIMILARITY) . 

FT MOD_RES 58 58 PHOSPHORYLATION (BY SIMILARITY) . 

FT MOD_RES 714 714 DIPHTHAMIDE (BY SIMILARITY) . 

SQ SEQUENCE 857 AA; 95247 MW; 368018758 1F518E6 CRC64 ; 



Query Match 34.2%; Score 865; DB 1; Length 857; 

Best Local Similarity 37.5%; Pred. No. 9e-56; 

Matches 183; Conservative 94; Mismatches 189; Indels 22; Gaps 



Qy 


9 


NLILSPLLGNVCFSSSQYSICFTLGSFAKI YADTF GDI NYQEFAKRLW 

l . . . !>i| 1 1 [ • III 1 I * • I 1 || * * I'll 
I ... I . | | | I l • 111 I I • * I I II • • i-ii 

N IMI D P VL GT VG FGS GLHGWAFT LKQ FAEM YVAK FAAKGDAQMN P T ERAKKVEDMMKKLW 


56 


Db 


201 


260 


Qy 


57 


GDI YFNPKTRKFTKKAPTSSSQ RSFVEFILEPLYKI LAQWGDVDTSLPRTLDELGI 

1 | 1 1 . 1 1 | ] . | 1 . 1 * 1 * 1 1 * 1 • ■ 1 * . . .... | 1 

I l | | . | | I I . I I • 1*1 • i i • i • . i • * • .... t i 

GDRYFDPATGKFSKSATGPDGKKLPRTFCQLILDPIFKVFDAIMTFKKEEAAKLIEKLDI 


113 


Db 


261 


320 


Qy 


114 


HLTKEELKLNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDL 

i i • .iii.i... i 1*111 1 • 1 II 1 * 
I | . * I I I . [ . . • 1 1*111 1 • 1 II 1 * 

KLDSEDKDKEGKPLLKAVMRRWLPAGDALLQMITIHLPSPVTAQKYRCELLYEGPPDDEA 


173 


Db 


321 


380 


Qy 


174 


GEAMSDCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQ PVKVLGENYTLEDEED 

- I I I I II • <ll 1 1 * I * 1 1 1 1 II • 1 1***1 III * 1 1 
: . | | | I II . • 1 1 1 1 * 1 * 1 1 1 1 II • 1 1 * * * I ill -it 

AIGIKNCDPRGSLMMYISKMVPTSDKGRFYAFGRVFSGLVSTGLKVRIMGPNYTPGKKED 


233 


Db 


381 


440 


Qy 


234 


SQICTVGRLWISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLK 
• I | i • || | | | • I | | | : | | t 1 1 1 II 1 : 1 
L YLKP I QRT I LMMGRYVEP I EDVP CGNI VGLVG VDQFLVKT GT ITT FEHAHNMRVMK 


293 


Db 


441 


497 


Qy 


294 


FNTTSVIKIAVEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVM 

| : : | : : : M | I I : : 1 1 I : : : 1 1 : : : 11 1 : : 1 1 1 1 1 1 : 1 1 1 1 1 : 1 : : 
FSVSPWRVAVEAKNPADLPKLVEGLKRLAKSDPMVQCIIEESGEHIIAGAGELHLEICL 


353 


Db 


498 


557 


Qy 


354 


HDLRKMYSEIDIKVADPWTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGL7VEDIE 

II : : : i I I : I i 1 1 : : II! 1 1 : : 1 : : : 1 1 1 1 : : 1 1 ! MINI: 
KDLEEDHACIPIKKSDPWSYRETVSEESNVMCLSKSPNKHNRLYMKARPFPDGLAEDID 


413 


Db 


558 


617 


Qy 


414 


NEWQITWNRKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGS 

I I : : II : 1 1 : 1 i 1 1 1 1 1 i 1 1 1 1 1 1 i 1 ' ! 

KGEVSARQELKQRARYLAEKYEWDA/T EARKIWCFGPDGTGPNILTDITKGVQ YLNE 


473 


Db 


618 


673 


Qy 


474 


VKDSIVQG 481 
: 1 M : 1 1 




Db 


674 


IKDSWAG 681 





RESULT 8 
EF2_MESAU 

ID EF2_MESAU STANDARD; PRT; 857 AA. 

AC P05086; 

DT 13-AUG-1987 (Rel. 05, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Elongation factor 2 (EF-2) . 

GN EEF2 . 

OS Mesocricetus auratus (Golden hamster) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Cricetinae; 

OC Mesocricetus. 

OX NCBI_TaxID=10036; 

RN [1] 



RP SEQUENCE FROM N . A. 

RX MEDLINE=8 62597 16; PubMed=3 0 14523 ; 

RA Kohno K. , Uchida T . , Ohkubo H., Nakanishi S., Nakanishi T., 

RA Fukui T., Ohtsuka E . , Ikehara M. , Okada Y. ; 

RT "Amino acid sequence of mammalian elongation factor 2 deduced from 

RT the cDNA sequence: homology with GTP-binding proteins."; 

RL Proc. Natl. Acad. Sci. U.S.A. 83:4978-4 982(1986). 

CC -!- FUNCTION: THIS PROTEIN PROMOTES THE GTP- DEPENDENT TRANSLOCATION 
CC OF THE NASCENT PROTEIN CHAIN FROM THE A-SITE TO THE P-SITE OF THE 

CC RIBOSOME. 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic. 

cc _i_ PTM: PHOSPHORYLATION BY EF-2 KINASE COMPLETELY INACTIVATES EF-2 . 

CC -!- PTM: HIS-714 IS MODIFIED IN DIPHTHAMIDE (2- [3-CARBOXYAMIDO-3- 
CC (TRIMETHYL-AMMONIO) PROPYL] HI STIDINE) . DIPHTHAMIDE CAN BE ADP- 

CC RIBOSYLATED BY DIPHTERIA TOXIN AND BY PSEUDOMONAS EXOTOXIN A. 

CC -!- SIMILARITY: BELONGS TO THE GTP-BINDING ELONGATION FACTOR FAMILY . 
CC EF-G/EF-2 SUBFAMILY. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; M13708; AAA50387.1; -. 

DR PIR; A25440; A25440. 

DR HSSP; P13551; 1FNM. 

DR InterPro; IPR000795; EF_GTPbind. 

DR InterPro; IPR000640; EFG_C. 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; IPR005225; Small_GTP. 

DR Pfam; PF00679; EFG_C; 1. 

DR Pfam; PF03764; EFG_IV; 1. 

DR Pfam; PF00009; GT P_E FTU; 1. 

DR Pfam; PF03144; GTP__EFTU_D2 ; 1. 

DR PRINTS; PR00315; ELONGATNFCT . 

DR TIGRFAMs; TIGR00231; small_GTP; 1. 

DR PROSITE; PS00301; EFACTOR_GTP; 1. 

KW Elongation factor; GTP-binding; Protein biosynthesis; Phosphorylation. 

FT INIT_MET 0 0 BY SIMILARITY. 

FT NP_BIND 25 32 GTP (BY SIMILARITY) . 

FT NP_BIND 103 107 GTP (BY SIMILARITY) . 

FT NP_BIND 157 160 GTP (BY SIMILARITY) . 

FT MOD__RES 56 56 PHOSPHORYLATION (BY SIMILARITY) . 

FT MOD_RES 58 58 PHOSPHORYLATION (BY SIMILARITY) . 

FT MOD~RES 714 714 DIPHTHAMIDE. 

SQ SEQUENCE 857 AA; 95192 MW; 42356544F43947AB CRC64; 

Query Match 34.2%; Score 863; DB 1; Length 857; 

Best Local Similarity 37.1%; Pred. No. 1.3e-55; 

Matches 181; Conservative 95; Mismatches 190; Indels 22; Gaps 5; 
Qy 9 NLILSPLLGNVCFSSSQYSICFTLGSFAKI YADTF GDI NYQEFAKRLW 56 



Db 201 NIMIDPVLGTVGFGSGLHGWAFTLKQFAEMYVAKFAAKGEGQLGPAERAKKVEDMMKKLW 260 

Qy 57 GDI YFNPKTRKFTKKAPTSSSQ RSFVEFILEPLYKILAQWGDVDTSLPRTLDELGI 113 

I | I I : I I I : I I : : I : I : I I : I : : I : : : : : : : I I 

Db 2 61 GDRYFDPANGKFSKSANSPDGKKLPRTFCQLILDPIFKVFDAIMNFRKEETAKLIEKLDI 32 0 

Qy 114 HLTKEELKLNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDL 173 

I | : : | | | : | : : : : I I : I I I I : 1 II I : 

Db 321 KLDSEDKDKEGKPLLKAVMRRWLPAGDALLQMITIHLPSPVTAQKYRCELLYEGPPDDEA 38 0 

Qy 174 GEAMSDCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEED 233 

: I I I ! i I I : : I I II : I : i I I I I I : i I : : : I I I I : I : 
Db 381 AMGIKSCDPKGPLMMYI SKMVPTSDKGRFYAFGRVFSGVVSTGLKVRIMGPNYTPGKKEE 44 0 

Qy 234 S Q I C T VGRLW I S VARYH I EVN RVP AGNWVL I E GVDQ P I VKT AT I T E P RGN EEAQ I FRP L K 293 

: : I : : 1 I : I I I I I : I I I I : I I I I I I II I : I 

Db 441 L YL K P I Q RT I LMMG R YVE PIE D VP C GN I VGL VG VDQ F L VKT GT ITT FEHAHNMRVMK 497 

Qy 2 94 FNTTSVIKIAVEPWPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVM 353 

I : : I : : : M I I I : : I I I : : : I I : : : II I : : I I I I I I : I I I I I : I : : 
Db 498 FSVS PWRVAVEAKNPADLPKLVEGLKRLAKSDPMVQCIIEESGEHIIAGAGELHLEICL 557 

Qy 354 HDLRKMYSEIDIKVADPVVTFCETVVETSSLKCFAETPNKKNKITMI7VEPLEKGLAEDIE 413 

II : : : I I I : I I I I : : III I I : : I : : : I I I I : : I I I I I I I I I • 

Db 558 KDLEEDHACIPIKKSDPWSYRETVSEESNVLCLSKSPNKHNRLYMKARPFPDGLAEDID 617 

Qy 414 NEWQITWNRKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGS 473 

I I : I I : I I : M M I I I I I I I II I I I : I 
Db 618 KGEVSARQELKARARYLAEKYEWDVAEARKIWCFGPDGTGPNILTDITKGVQ YLNE 673 

Qy 474 VKDSIVQG 481 

: 1 I I : I I 
Db 67 4 IKDSWAG 681 



RESULT 9 
EF2_CRYPV 

ID EF2_CRYPV STANDARD; PRT; 832 AA. 

AC Q23716; 

DT 15-JUL-1998 (Rel. 36, Created) 

DT 15-JUL-1998 (Rel. 36, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Elongation factor 2 (EF-2) . 

OS Cryptosporidium parvum. 

OC Eukaryota; Alveolata; Apicomplexa ; Coccidia; Eimeriida; 

OC Cryptosporidiidae; Cryptosporidium. 

OX NCBI_TaxID=5807 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=AUCP-1; 

RX MEDLINE=95356792; PubMed=7 63037 9 ; 

RA Jones D.E., Tu T.D., Mathur S., Sweeney R.W., Clark D.P.; 

RT "Molecular cloning and characterization of a Cryptosporidium parvum 

RT elongation factor-2 gene."; 

RL Mol. Biochem. Parasitol. 71:143-147(1995). 

CC -!- FUNCTION: THIS PROTEIN PROMOTES THE GTP- DEPENDENT TRANSLOCATION 
CC OF THE NASCENT PROTEIN CHAIN FROM THE A-SITE TO THE P-SITE OF THE 



CC RIBOSOME (BY SIMILARITY) . 

CC SUBCELLULAR LOCATION: Cytoplasmic (By similarity). 

CC -!- SIMILARITY: BELONGS TO THE GTP- BINDING ELONGATION FACTOR FAMILY. 
CC EF-G/EF-2 SUBFAMILY. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U21667; AAC46607.1; -. 

DR InterPro; IPR000795; EF_GTPbind. 

DR InterPro; IPR000640; EFG_C . 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; IPR005225; Small_GTP . 

DR Pfam; PF00679; EFG_C; 1. 

DR Pfam; PF03764; EFG_IV; 1. 

DR Pfam; PF00009; GT P_E FTU ; 1. 

DR Pfam; PF03144; GTP_EFTU_D2 ; 1. 

DR PRINTS; PR00315; ELONGATNFCT . 

DR TIGRFAMs; TIGR00231; small_GTP; 1. 

DR PROSITE; PS00301; EFACTOR_GTP; 1. 

KW Elongation factor; Protein biosynthesis; GTP~binding; Phosphorylation. 



FT 


NP BIND 


26 


33 


GTP (BY SIMILARITY) . 


FT 


NP BIND 


98 


102 


GTP (BY SIMILARITY) . 


FT 


NP_BIND 


152 


155 


GTP (BY SIMILARITY) . 


FT 


MOD RES 


57 


57 


PHOSPHORYLATION (BY SIMILARITY) 


FT 


MOD_RES 


59 


59 


PHOSPHORYLATION (BY SIMILARITY) 


FT 


MOD RES 


689 


689 


DIPHTHAMIDE (BY SIMILARITY) . 


SQ 


SEQUENCE 


832 AA; 


92761 


MW; 577D2DE23D77E3FA CRC64; 


Query Match 




33. 8 £ 


h; Score 854; DB 1; Length 832; 



Best Local Similarity 37.3%; Pred. No. 5.5e-55; 

Matches 174; Conservative 99; Mismatches 175; Indels 18; Gaps 4; 

Qy 9 NLILS PLLGNVCFSSSQYSICFTLGSFAKI YADTFGDINYQEFAKRL 55 

I : I : i I lilt : I I : I I : I I I i I : : : I I 

Db 180 NVI I STYS DELMGDVQVFPEKGTVS FGSGLHGWAFTI EKFARI YAKKFG-VEKS KMMQRL 238 

Qy 56 WGDIYFNPKTRKFTKKAPTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHL 115 

Ml : i I I : I : t M I I : i : I : I I : I I : : : : : : : I I I : I 

Db 239 WGDNFFNPETKKFT-KTQEPGSKRAFCQFIMEPICQLFSSIMNGDKAKYEKMLVNLGVEL 297 

Qy 116 TKEELKLNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGE 17 5 

: : I : I I I : I : : : : I I I : I I I f : : | : | | | : : 

Db 298 KGDDKALVDKPLLKKVMQLWLSAGDTLLEMIVTHLPSPAAAQKYRVENLYEGPQDDETAK 357 

Qy 17 6 AMSDCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQ 235 

: : I I I I I E : I I II : I : I I I I I I I : II I : : I I : II 

Db 358 GIRNCDPDAPLCMFVS KMVPTSDKGRFYAFGRVFSGTVATGQKVRIQGPRYVPGGKEDLN 417 

Qy 236 ICTVGRLWISVARYHIEWRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFN 295 

I : I : : I I : : I M I ! I : I : I I : : I : I I I : I I : I : : 



Db 



418 I KNI QRT VLMMGRYVEQI PDVPAGNT VGLVGI DQYLLKS GT ITT SETAHNIASMKYS 474 



Qy 296 TTSVIKIAVEPWPSELPKMLDGLRPCVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHD 355 

: I : : : I I I : I I I I : : : i I : I : : I I I : I I : I I I : I I III::: : I 

Db 475 VSPWRVAVRPKDNKELPKLVEGLKKLSKSDPLWCSKEETGEHIIAGCGELHVEICLQD 534 

Qy 356 LRKMYSEIDIKVADPVVTFCETWETSSLKCFAETPNKKNKITMIMPLEKGLAEDIENE 415 

I : : I : : I : I : I I : I : : I I I I i : I : : : I I I I : : I I I I I I I : I I I 
Db 535 LQQEYAQIEIVASDPIVSYRETVVNLSNQTCLSKSPNKHNRLYMTAEPLPDGLTDDIEEG 594 

Qy 416 WQITWNRKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDT 461 

I : I : I I : I II I I I I I : I I I I I : I I I 

Db 595 KVSPRDDPKERSNLLHDKYGFDKNAAMKIWCFGPETTGPNIMVDVT 64 0 



RESULT 10 




EF2_ 


_DROME 




ID 


EF2 DROME STANDARD; PRT; 843 AA. 




AC 


P13060; Q9I7H2; Q9V9R0; 




DT 


01-JAN-1990 (Rel. 13, Created) 




DT 


28-FEB-2003 (Rel. 41, Last sequence update) 




DT 


28-FEB-2003 (Rel. 41, Last annotation update) 




DE 


Elongation factor 2 (EF-2) . 




GN 


EF2B OR CG2238 . 




OS 


Drosophila melanogaster (Fruit fly) . 




OC 


Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; 


Pterygota ; 


OC 


Neoptera; Endopterygota ; Diptera; Brachycera; Muscomorpha; 


OC 


Ephydroidea; Drosophilidae; Drosophila. 




OX 


NCBI TaxID=7227; 




RN 


[1] 




RP 


SEQUENCE FROM N . A. (ISOFORM LONG) . 




RX 


MEDLINE= 90016792; PubMed-2 508 059; 




RA 


Grinblat Y. , Brown N.H., Kafatos F.C.; 




RT 


"Isolation and character! zat ion of the Drosophila 


trans la tional 


RT 


elongation factor 2 gene."; 




RL 


Nucleic Acids Res. 17:7303-7 314(198 9). 




RN 


[2] 




RP 


SEQUENCE FROM N.A. (ISOFORMS LONG AND SHORT) . 




RC 


STRAIN=Berkeley; 




RX 


MEDLINE=2 01960 0 6; PubMed=107 31132 ; 




RA 


Adams M.D., Celniker S.E., Holt R. A. , Evans C.A. , 


Gocayne J . D . , 


RA 


Amanatides P.G., Scherer S.E., Li P.W., Hoskins R. 


A. , Galle R. F. , 


RA 


George R.A. , Lewis S.E., Richards S., Ashburner M. 


, Henderson S.N., 


RA 


Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q . , 


Chen L. X. , 


RA 


Brandon R.C., Rogers Y.-H.C, Blaze j R.G., Champe 


M. , Pfeiffer B.D. , 


RA 


Wan K.H., Doyle C, Baxter E.G., Helt G., Nelson C 


. R. , Miklos G. L. G. , 


RA 


Abril J.F., Agbayani A., An H.-J., Andrews-Pf annkoch C, Baldwin D., 


RA 


Ballew R.M. , Basu A., Baxendale J., Bayraktaroglu 


L., Beasley E.M., 


RA 


Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., 


Bolshakov S . , 


RA 


Borkova D., Botchan M.R., Bouck J., Brokstein P., 


Brottier P . , 


RA 


Burtis K.C., Busam D.A., Butler H., Cadieu E., Center A., Chandra I., 


RA 


Cherry J.M., Cawley S., Dahlke C, Davenport L.B., 


Davies P . , 


RA 


de Pablos B., Delcher A., Deng Z., Mays A.D., Dew 


I . , Dietz S .M. , 


RA 


Dodson K., Doup L.E., Downes M. , Dugan-Rocha S., Dunkov B.C., Dunn P., 


RA 


Durbin K.J., Evangelista C.C., Ferraz C, Ferriera 


S., Fleischmann W., 


RA 


Fosler C, Gabrielian A.E., Garg N.S., Gelbart W.M 


. , Glasser K. , 


RA 


Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., 


Harris M . , 



RA Harris N.L., Harvey D., Heiman T.J., Hernandez J.R., Houck J., 

RA Hostin D., Houston K.A., Howland T.J., Wei M.-H., Ibegwam C, 

RA Jalali M. , Kalush F. , Karpen G.H., Ke Z . , Kennison J. A. , Ketchum K.A., 

RA Kimmel B.E., Kodira CD., Kraft C, Kravitz S., Kulp D., Lai Z . , 

RA Lasko P., Lei Y. , Levitsky A. A. , Li J., Li Z., Liang Y . , Lin X., 

RA Liu X., Mattei B . , Mcintosh T.C., McLeod M.P., McPherson D., 

RA Merkulov G., Milshina N.V., Mobarry C, Morris J., Moshrefi A., 

RA Mount S.M., Moy M. , Murphy B . , Murphy L., Muzny D.M., Nelson D . L . , 

RA Nelson D.R., Nelson K.A. , Nixon K . , Nusskern D.R., Pacleb J.M., 

RA Palazzolo M. , Pittman G.S., Pan S., Pollard J. , Puri V., Reese M.G., 

RA Reinert K. , Remington K. , Saunders R.D.C., Scheeler F. , Shen H . , 

RA Shue B.C., Siden-Kiamos I., Simpson M. , Skupski M.P., Smith T., 

RA Spier E . , Spradling A.C., Stapleton M. , Strong R., Sun E . , 

RA Svirskas R. , Tector C . , Turner R. , Venter E., Wang A.H., Wang X., 

RA Wang Z.-Y., Wassarman D.A. , Weinstock G.M. r Weissenbach J., 

RA Williams S.M., Woodage T., Worley K.C., Wu D., Yang S., Yao Q.A., 

RA Ye J., Yeh R.-F., Zaveri J.S., Zhan M. , Zhang G., Zhao Q . , Zheng L., 

RA Zheng X.H., Zhong F.N., Zhong W. , Zhou X . , Zhu S., Zhu X., Smith H.O., 

RA Gibbs R.A. , Myers E.W., Rubin G.M., Venter J.C. ; 

RT "The genome sequence of Drosophila melanogaster . " ; 

RL Science 2 87:2185-2195(2000). 

CC FUNCTION: THIS PROTEIN PROMOTES THE GTP-DEPENDENT TRANSLOCATION OF 

CC THE NASCENT PROTEIN CHAIN FROM THE A-SITE TO THE P-SITE OF THE 

CC RIBOSOME. 

CC SUBCELLULAR LOCATION: Cytoplasmic. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=Long; 

CC IsoId=Pl3060-l; Sequence=Displayed; 

CC Name=Short; 

CC IsoId=Pl3060-2; Sequence=VSP_0 01361 , VSP_001362; 

CC Note=No experimental confirmation available; 

CC -!- PTM: PHOSPHORYLATION BY EF-2 KINASE COMPLETELY INACTIVATES EF-2 . 

CC -!- SIMILARITY: BELONGS TO THE GTP- BINDING ELONGATION FACTOR FAMILY. 
CC EF-G/EF-2 SUBFAMILY. 

CC -!- CAUTION: Ref.2 sequence differs from that shown due to erroneous 
CC gene model prediction. 

cc 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch ) . 

CC 

DR EMBL; X15805; CAA33804.1; -. 

DR EMBL; AE003781; AAF57226.1; ALT^SEQ. 

DR EMBL; AE003781; AAG22125.1; ALT_SEQ. 

DR PIR; S05988; S05988. 

DR FlyBase; FBgn0000559; Ef2b. 

DR InterPro; IPR000795; EF_GTPbind. 

DR InterPro; IPR000640; EFG_C . 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; IPR005225; Small_GTP. 

DR Pfam; PF00679; EFG C; 1. 



DR Pfam; PF03764; EFG_IV; 1. 

DR Pfam; PF00009; GTP_EFTU; 1. 

DR Pfam; PF0314 4; GTP_EFTU__D2 ; 1. 

DR PRINTS; PR00315; ELONGATNFCT. 

DR TIGRFAMs; TIGR00231; small_GTP; 1. 

DR PROSITE; PS00301; EFACTORJ3TP; 1. 

KW Elongation factor; GTP-binding; Protein biosynthesis; Phosphorylation; 

KW Alternative splicing. 

FT INIT_MET 0 0 BY SIMILARITY. 

FT NP_BIND 25 32 GTP (BY SIMILARITY) . 

FT NP_BIND 107 111 GTP (BY SIMILARITY) . 

FT NP_BIND 161 164 GTP (BY SIMILARITY) . 

FT MOD_RES 56 56 PHOSPHORYLATION (BY SIMILARITY) . 

FT MOD_RES 58 58 PHOSPHORYLATION (BY SIMILARITY) . 

FT MOD_RES 700 700 DIPHTHAMIDE (BY SIMILARITY) . 

FT VARSPLIC 700 710 HRGGGQI I PTT -> TSKCSWLSGKA (in isoform 

FT Short) . 

FT / FT I d=VS P_0 0 1 3 6 1 . 

FT VARSPLIC 711 843 Missing (in isoform Short) . 

FT /FTId=VSP_001362 . 

FT CONFLICT 606 606 D -> E (IN REF. 1) . 

SQ SEQUENCE 843 AA; 94327 MW; C4BEE4 37F2 3E5593 CRC64; 

Query Match 33.4%; Score 844.5; DB 1; Length 843; 

Best Local Similarity 37.2%; Pred. No. 2.8e-54; 

Matches 175; Conservative 103; Mismatches 183; Indels 9; Gaps 4 

Qy 12 LSPLLGNVCFSSSQYSICFTLGSFAKIYADTFGDINYQEFAKRLWGDIYFNPKTRKFTKK 71 

: I I : I I I : III I : : : I : : I I : : I I I i : : I I I i : I : : I 
Db 207 VDPSKGSVGFGSGLHGWAFTLKQFSEMYSEKF-KIDWKLMNRLWGENFFNAKTKKW-QK 264 

Qy 72 APTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLNIRPLLRLV 131 

: : : I I I : I i : I : I I : : : : I : : : I : I I : : : I I : I 

Db 2 65 QKEADNKRSFCMYILDPI YKVFDAIMNYKKEEIGTLLEKIGVTLKHEDKDKDGKALLKTV 32 4 

Qy 132 CKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDPDGPLMCHTT 191 

: : : i I : I I I I I : : I II I : I : I I I I I I I I : : 

Db 325 MRTWLPAGEALLQMIAIHLPSPWAQKYRMEMLYEGPHDDEAAIAVKSCDPDGPLMMYIS 384 

Qy 192 KMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLWISVARYHI 251 

II II : I : I I I | : | : I j : : : I I I I : I I : | : : I i 

Db 385 KMVPTSDKGRFYAFGRVFAGKVATGQKCRIMGPNYTPGKKEDLYEKAIQRTILMMGRYVE 444 

Qy 252 EVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTTSVIKIAVEPVNPSE 311 

: I I : I I : I I I I : I I I I I I : : I : : I I : : I : : : II I I I I : : 

Db 445 AI E DVP S GN I CGL VGVDQ FLVKT GT ITT FKDAHNMKVMKFSVSPVVRVAVEPKN PAD 501 

Qy 312 LPKMLDGLRKA/NKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEIDIKV7VDPV 371 

I M : : : I I : : : I I I : : I I i I I I : I I I I I : I : : I I : : : I : I : I i I 
Db 502 LPKLVEGLKRLAKSDPMVQCIIEESGEHIIAGAGELHLEICLKDLEEDHACIPLKKSDPV 561 

Qy 372 VTFCETWETSSLKCFAETPNKKNKITMI7\EPLEKGLAEDIENEVVQITWNRKKLGEFFQ 431 

I : : I I I I I j : : : | M I : : I I I : I I I I I : I I I : 

Db 5 62 VS YRET VSEESDQMCLSKS PNKHNRLLMKALPMPDGLPEDI DNGDVSAKDEFKARARYLS 621 

Qy 432 TKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGSVKDSIVQG 481 

I I I : I : I I 1 I I I I I I I j I :: I I : I : I I I : I I 



Db 622 EKYDYDVT EARKIWCFGPDGTGPNFILDCTKSVQ YLNEIKDSWAG 667 

RESULT 11 
EF2__CANAL 

ID EF2_CANAL STANDARD; PRT ; 842 AA. 

AC 013430; 

DT 15-JUL-1998 (Rel. 36, Created) 

DT 15-JUL-1998 (Rel. 36, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Elongation factor 2 (EF-2) . 

GN EFT2 . 

OS Candida albicans (Yeast) . 

OC Eukaryota; Fungi; As comycota ; Saccharomycotina ; Saccharomycetes ; 

OC Saccharomyce tales ; mitosporic Saccharomycetales ; Candida. 

OX NCBI_TaxID-54 7 6 ; 

RN [1] 

RP SEQUENCE FROM N . A. 

RC STRAIN=ATCC 102 61; 

RA Capa L., Mendoza A., Serramia M.J., Garcia-Bus tos J.F.; 

RL Submitted (NOV-1997) to the EMBL/ GenBank/DDBJ databases. 

CC -!- FUNCTION: THIS PROTEIN PROMOTES THE GTP- DEPENDENT TRANSLOCATION 

CC OF THE NASCENT PROTEIN CHAIN FROM THE A-SITE TO THE P-SITE OF THE 

CC RIBOSOME (BY SIMILARITY) . 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic (By similarity). 

CC -!- SIMILARITY: BELONGS TO THE GTP- BINDING ELONGATION FACTOR FAMILY. 
CC EF-G/EF— 2 SUBFAMILY. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; Y09664; CAA70857.2; -. 

DR InterPro; IPR000795; EF_GTPbind. 

DR InterPro; IPR000640; EFG_C . 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; IPR005225; Small_GTP. 

DR Pfam; PF00679; EFG_C; 1. 

DR Pfam; PF037 64; EFG_IV; 1. 

DR Pfam; PF00009; GTP__EFTU ; 1. 

DR Pfam; PF0314 4; GTP_EFTU__D2 ; 1. 

DR PRINTS; PR00315; ELONGATNFCT. 

DR TIGRFAMs; TIGR00231; small_GTP; 1. 

DR PROSITE; PS00301; EFACTOR_GTP; 1. 

KW Elongation factor; Protein biosynthesis; GTP-binding. 

FT NP_BIND 2 6 33 GTP (BY SIMILARITY) . 

FT NP_BIND 104 108 GTP (BY SIMILARITY) . 

FT NP_BIND 158 161 GTP (BY SIMILARITY) . 

FT MOD_RES 699 699 DIPHTHAMIDE (BY SIMILARITY) . 

SQ SEQUENCE 842 AA; 93354 MW; 5C8C74 0109A2318 9 CRC64; 



Query Match 



33.2%; Score 839; DB 1; Length 842; 



Best Local Similarity 35.6%; Pred. No. 7.1e-54; 

Matches 174; Conservative 116; Mismatches 171; Indels 28; Gaps 6 

Qy 9 NLILS PLLGNVCFSS SQYS ICFTLGS FAKI YADTFGDINYQEFAKRL 55 

I : I : I I I I I : I : I I : I I I : I I : : : : : I I 

Db 186 NVIISTYCDPVLGDVQVYPQKGTVAFASGLHGWAFTVRQFANKYSKKFG-VDKEKMMERL 244 

Qy 56 WGDI YFNPKTRKFTKK APTSSSQRS FVEFILEPLYKILAQWGDVDTSLPRTLDELG 112 

I I I i I I I I I : I : I I I : I : I I I f : I : : : : I : : : I | : : | 

Db 245 WGDSYFNPKTKKWTNKDKDADGKPLERAFNMFILDPIFRLFAAIMNFKKDEIPVLLEKLE 304 

Qy 113 IHLTKEELKLNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSD 172 

II : I I : I I : : I : I I : : | I 1 : i | | : | | | | 

Db 3 05 IQLKGDEKDLEGKALLKVVMRKFLPAADALLEMIVLHLPSPVTAQAYRAETLYEGPSDDP 3 64 

Qy 173 LGEAMSDCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEE 232 

I : : I I I : I I : : I I I I : I : I I I I : I I : : I I I : : I I I : : I 
Db 365 FCTAI RNCDPNADLML YVS KMVPT S DKGRF YAFGRVFAGTVKS GQKVRI QGPNYQVGKKE 424 

Qy 233 DSQICTVGRLWISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPL 292 

1 : : : f : : I : : : I I I I : : I : I I : : I : I I I III : : 

Db 425 DLFLKSIQRTVLMMGRSVEQIDDCPAGNIIGLVGIDQFLLKSGT1TT NEAAHNMKVM 481 

Qy 2 93 KFNTTSVIKIAVEPVNPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCV 352 

II: : I : : : I I I I : : I I I : : : I I : : : : I I I : I : Mill:: | | I I I : I : 
Db 482 KFSVS PVVQVAVEVKNANDLPKLVEGLKRLS KS DPCVLT YMSESGEHI VAGTGELHLEI C 541 

Qy 353 MHDLRKMYSEIDIKVADPVVTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDI 412 

: I I ::::::: | | | :: | M || : : : : i M I : I : I : I : : : : : I I 
Db 542 LQDLENDHAGVPLRISPPVVS YRETVEGESSMVALSKSPNKHNRI YVP<AQPIDEEVSLDI 601 

Qy 413 ENEVVQITWNRKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKAL — 470 

MM : I I : II :: II I I I I I I I I I : : M I I I : 

Db 602 ENGVINPRDDFKARARILADKHGWDWDARKIWCFGPDGNGPNLWDQT KAVQY 655 

Qy 471 LGSVKDSIV 479 

I : I I i : I 

Db 656 LNEIKDSW 664 



RESULT 12 
EF2_CAEEL 

ID EF2_CAEEL STANDARD; PRT; 851 AA. 

AC P29691; 017837; 

DT 01-APR-1993 (Rel. 25, Created) 

DT 15-JUL-1998 (Rel. 36, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Elongation factor 2 (EF-2) . 

GN EFT-2 OR F25H5.4. 

OS Caenorhabditis elegans . 

OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; Rhabditoidea ; 

OC Rhabditidae; Peloderinae; Caenorhabditis. 

OX NCBI_TaxID==6239; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=92029622; PubMed=l930695 ; 

RA Ofulue E.N., Candido E.P.M.; 



RT "Molecular cloning and characterization of the Caenorhabditis elegans 

RT elongation factor 2 gene (eft-2) . " ; 

RL DNA Cell Biol. 10:603-611(1991). 

RN [2] 

RP SEQUENCE FROM N . A. 

RC STRAIN=Bris tol N2 ; 

RA Steward C . ; 

RL Submitted (OCT-1996) to the EMBL/ GenBank/DDBJ databases. 

CC -!- FUNCTION: THIS PROTEIN PROMOTES THE GTP- DEPENDENT TRANSLOCATION 
CC OF THE NASCENT PROTEIN CHAIN FROM THE A-SITE TO THE P-SITE OF THE 

CC RIBOSOME. 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic. 

CC -!- PTM: PHOSPHORYLATION BY EF-2 KINASE COMPLETELY INACTIVATES EF-2. 

CC -!- SIMILARITY: BELONGS TO THE GTP- BINDING ELONGATION FACTOR FAMILY. 
CC EF-G/EF-2 SUBFAMILY. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; M86959; AAD03339.1; -. 

DR EMBL; Z81068; CAB02985.1; -. 

DR PIR; A40411; A40411. 

DR PIR; T21362; T21362. 

DR WormPep; F25H5.4; CE15900. 

DR InterPro; IPR000795; EF_GTPbind. 

DR InterPro; IPR000640; EFG_C. 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR Pfam; PF00679; EFG_C; 1. 

DR Pfam; PF03764; EFG_IV; 1. 

DR Pfam; PF000 09; GTP_EFTU; 1. 

DR Pfam; PF03144; GTP_EFTU_D2 ; 1. 

DR PRINTS; PR00315; ELONGATNFCT . 

DR PROSITE; PS00301; EFACTOR_GTP; 1. 

KW Elongation factor; GTP-binding; Protein biosynthesis; Phosphorylation. 



FT 


INI T_MET 


0 


0 




BY SIMILARITY. 


FT 


NP_BIND 


25 


32 




GTP {BY SIMILARITY) . 


FT 


NP_BIND 


115 


119 




GTP (BY SIMILARITY) . 


FT 


NP_BIND 


169 


172 




GTP (BY SIMILARITY) . 


FT 


MOD RES 


56 


56 




PHOSPHORYLATION (BY SIMILARITY) 


FT 


M0D_RES 


58 


58 




PHOSPHORYLATION (BY SIMILARITY) 


FT 


MOD_RES 


708 


708 




DI PHTHAMIDE (BY SIMILARITY) . 


FT 


CONFLICT 


131 


131 




R -> G (IN REF . 1) . 


FT 


CONFLICT 


616 


616 




N -> S (IN REF . 1) . 


FT 


CONFLICT 


627 


62 9 




I LA -> YPG {IN REF. 1) . 


FT 


CONFLICT 


635 


635 




D -> A (IN REF. 1) . 


SQ 


SEQUENCE 


851 AA; 


94665 


MW; 2B2460CEB6399176 CRC64; 


Query Match 




33. 


1 £ 


h; Score 837.5; DB 1; Length 851 


Best Local 


Similarity 


36. 


2 s 


h; Pred. No. 9.3e-54; 



Matches 175; Conservative 103; Mismatches 193; Indels 13; Gaps 5; 



Qy 2 AYYSTDEN LILSPLLGNVCFSSSQYSICFTLGSFAKI YADTFGDINYQEFAKRLWG 57 

II I : : : : | : i M | I : I I I : : I I I I : : I j | | 

Db 201 ATYGDDDGPMGPIMVDPSIGNVGFGSGLHGWAFTLKQFAEMYAGKFG-VQVDKLMKNLWG 259 

Qy 58 DIYFNPKTRKFTKKAPTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTK 117 



Db 260 DRFFDLKTKKWS-STQTDESKRGFCQFVLDPIFMVFDAVMNIKKDKTAALVEKLGIKLAN 318 

Qy 118 EELKLNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAM 177 

: I I : | | : : : : I : : I I : I I I I : : I | | | : | : 

Db 319 DEKDLEGKPLMKVFMRKWLPAGDTMLQMIAFHLPSPVTAQKYRMEMLYEGPHDDEAAVAI 378 

Qy 178 SDCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQIC 237 

I I I : I I I I : : I I II : I : I I I I I I : I :: I II : I I 

Db 379 KTCDPNGPLMMYISKMVPTSDKGRFYAFGRVFSGKVATGMKARIQGPNYVPGKKEDLYEK 438 

Qy 238 TVGRLWISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKFNTT 297 

I : I : : I : : : I : I I : I I I I : II IN : : t I : I I : : 

Db 439 T I QRT I LMMGRFI EP I EDI P S GN I AGLVGVDQ YLVKGGT I TT YKDAHNMRVMKFSVS 495 

Qy 298 SVIKIAVEPVNPSELPKMLDGLRKVNKS YPSLTTKVEESGEHVILGTGELYLDCVMHDLR 357 

I : : : I I I II : : I I I : : : I I : : : II I : I I II I I : I I I I I : I : : i I 

Db 496 PWRVAVEAKNPADLPKLVEGLKRLAKSDPMVQCIFEESGEHIIAGAGELHLEICLKDLE 555 

Qy 358 KMYSEIDIKVADPVVTFCETWETSSLKCF7VETPNKKNKITMIAEPLEKGLAEDIENEW 417 

: : : I : I : I I I I : : I I I I : I : : : I M I : : I : I : 111 = 111 I 
Db 556 EDHACIPLKKSDPWSYRETVQSESNQICLSKSPNKHNRLHCTAQPMPDGLADDIEGGTV 615 

Qy 418 QITWNRKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKALLGSVKDS 47 7 

I : I I : : I : II I I I I I I I I I I : I : I I : I : I I I 

Db 616 NARDEFKARAKILAEKYEYDVTEARKIWCFGPDGTGPNLLMDVTKGVQ YLNEIKDS 671 

Qy 478 IVQG 481 

: I I 

Db 672 WAG 675 



RESULT 13 
EF2_ENTHI 

ID EF2_ENTHI STANDARD; PRT ; 84 0 AA. 

AC Q06193; 

DT 01-FEB-1994 (Rel. 28, Created) 

DT 01-FEB-1994 (Rel. 2 8, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Elongation factor 2 (EF-2) , 

GN EF-2 . 

OS Entamoeba histolytica. 

OC Eukaryota; Entamoebidae ; Entamoeba. 

OX NCBI_TaxID=5759; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=SFL-3; 

RX MEDLINE=93135 8 31; PubMed-8 422275 ; 

RA Plaimauer B., Ortner S., Wiedemann G. , Scheiner O., Duchene M, ; 

RT "Molecular characterization of the cDNA coding for translation 

RT elongation factor-2 of pathogenic Entamoeba histolytica."; 

RL DNA Cell Biol. 12:89-96(1993). 



RN [2] 

RP SEQUENCE OF 30-790 FROM N.A. 

RC STRAIN=HK- 9 ; 

RX MEDLINE=94355 0 65; PubMed=8 074 8 87 ; 

RA Shirakura ■ T . , Hashimoto T., Nakamura Y., Kamaishi T., Cao Y., 

RA Adachi J., Hasegawa M. , Yamamoto A. , Goto N.; 

RT "Phylogenetic place of a mitochondria-lacking protozoan, Entamoeba 

RT histolytica, inferred from amino acid sequences of elongation factor 

RT 2 1 . " ; 

RL Jpn. J. Genet. 69:119-135(1994). 

CC -!- FUNCTION: THIS PROTEIN PROMOTES THE GTP- DEPENDENT TRANSLOCATION 
CC OF THE NASCENT PROTEIN CHAIN FROM THE A-SITE TO THE P-SITE OF THE 

CC RIBOSOME. 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic. 

CC -!- PTM: PHOSPHORYLATION BY EF-2 KINASE COMPLETELY INACTIVATES EF-2 . 

CC -!- SIMILARITY: BELONGS TO THE GTP -BINDING ELONGATION FACTOR FAMILY. 
CC EF-G/EF-2 SUBFAMILY. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch) . 

CC 

DR EMBL; L02417; AAA29097.1; -. 

DR EMBL; D21259; BAA04 8 00.1; -. 

DR InterPro; IPR000795; EF_GTPbind. 

DR InterPro; IPR000640; EFG_C . 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; IPR005225; Small_GTP. 

DR Pfam; PF00679; EFG_C; 1. 

DR Pfam; PF037 64; EFG_IV; 1. 

DR Pfam; PF00009; GTP_EFTU; 1. 

DR Pfam; PF03144; GTP_EFTU_D2 ; 1. 

DR PRINTS; PR00315; ELONGATNFCT . 

DR TIGRFAMs ; TIGR00231; small_GTP; 1. 

DR PROSITE; PS00301; EFACTOR_GTP; 1. 

KW Elongation factor; GTP-binding; Protein biosynthesis; Phosphorylation. 



FT 


NP_BIND 


26 


33 


GTP (BY SIMILARITY) . 


FT 


NP BIND 


102 


106 


GTP (BY SIMILARITY) . 


FT 


NP_BIND 


156 


159 


GTP (BY SIMILARITY) . 


FT 


MOD RES 


57 


57 


PHOSPHORYLATION (BY SIMILARITY) 


FT 


MOD_RES 


59 


59 


PHOSPHORYLATION (BY SIMILARITY) 


FT 


MOD_RES 


697 


697 


DIPHTHAMIDE (BY SIMILARITY) . 


FT 


CONFLICT 


471 


471 


V -> DT (IN REF. 2) . 


FT 


CONFLICT 


541 


541 


E -> D (IN REF. 2) . 


FT 


CONFLICT 


592 


592 


P -> A (IN REF. 2) . 


SQ 


SEQUENCE 


840 AA; 


93324 


MW; 2E705327E3B52210 CRC64; 



Query Match 32.6%; Score 823.5; DB 1; Length 840; 

Best Local Similarity 36.3%; Pred. No. 9.8e-53; 

Matches 173; Conservative 104; Mismatches 182; Indels 17; Gaps 6; 



Qy 



12 LSPLLGNVCFSSSQYSICFTLGSFAKI YADTFGDINYQEFAKRLWGDI YFNPKTRKFTKK 71 



• I I I I I 1 • III I I I • • • II I * . * • I I I I I * * I . I . I 

Db 2 00 VSPGEGTVAFGSGLHGWAFTLEKFAKMWS7VKFG-IDRKRMLEKLWGDNYWDAKAKKWKKN 2 58 

Qy 72 APTSSS QRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLNIRPLL 12 8 

II I I : I : I : I : : : : I I I I : : : : : I I 

Db 259 GKGDHGEVLQRGFVQFCFDPITKLFNAIMEGRKADYEKMLTNLQIKLSADDKEKEGKELL 318 

Qy 129 RLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLGEAMSDCDPDGPLMC 188 

: I I : : : I I I : I I I I | : : I I I : I : : I I : : I I I I I I 

Db 319 KTVMKLWLPAGVTLLEMIVLHLPSPWAQKYRTSNLYTGPMDDEAAKAMANCDEKGPLMM 37 8 

Qy 189 HTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDSQICTVGRLWISVAR 248 

: : I I I : I : I : I I I I I I M i : : I I I : : I I : | : : | 

Db 379 YVSKMIPTNDKGRFYAFGRVFSGTIRTGGKARICGPNYVPGKKDDCVIKNIQRTMLMMGR 438 

Qy 249 YH I E VN RVP AGNWVL I E GVDQ P I VKT AT I T E P RGN E EAQ I FRP L K FNT T S VI K I AVE P VN 308 

I : : I I I : : I I I I : : i : I I I : I I : : I I : : I : : : I I I I 

Db 439 YTDPIDECPCGNVIGLVGVDQYLLKSGTITD SVAHI I KDMKFSVS PWRVAVETKN 4 94 

Qy 309 PSELPKMLDGLRKWKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMY-SEIDIKV 367 

I I : I I I ::: I :::::: I I i I I I I I : : I I M : I : : : | : : | | : : | 

Db 495 PSDLPKLVEGMKRLSRSDPLCLCYTEESGEHIVAGAGELHLEVCLKELQEDYCSGVPLIV 554 

Qy 368 ADPWTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWNRKKLG 427 

: I I I : I I I : I ! : : i : : : I : I : : I i I : I I E I I I i : : : I : 
Db 555 TEPWSFRETITEPSRIQCLSKSANNQNRLFMRAFPFPEGLAEDIEAGEIKPDTDFKERA 614 

Qy 428 EFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKAL — LGSVKDS IVQG 481 

: I MM: M II MM MM II I I : I I j M I I I 

Db 615 KFLSEKYGWDVDEARKIWCFGPDNCGPKLFVD VTKGIQYLNEVKDS IVNG 664 



RESULT 14 
EF2_BETVU 

ID EF2_BETVU STANDARD; PRT; 84 3 AA. 

AC 023755; 

DT 15-DEC-1998 (Rel. 37, Created) 

DT 15-DEC-1998 (Rel. 37, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Elongation factor 2 (EF-2) . 

OS Beta vulgaris (Sugar beet) . 

OC Eukaryota; Viridiplantae ; Streptophyta ; Embryophyta; Tracheophyta; 

OC Spermatophyta ; Magnoliophyta ; eudicotyledons ; core eudicots ; 

OC Caryophyllidae; Caryophyllales ; Chenopodiaceae ; Beta. 

OX NCBI_TaxID=161934; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Root; 

RA Vogel R. , Rausch T.; 

RL Submitted (JUN-1997) to the EMBL/ GenBank/DDBJ databases. 

CC -!- FUNCTION: THIS PROTEIN PROMOTES THE GTP- DEPENDENT TRANSLOCATION 
CC OF THE NASCENT PROTEIN CHAIN FROM THE A-SITE TO THE P-SITE OF THE 

CC RIBOSOME. 

CC SUBCELLULAR LOCATION: Cytoplasmic. 

CC -!- PTM: PHOSPHORYLATION BY EF-2 KINASE COMPLETELY INACTIVATES EF-2 
CC (BY SIMILARITY) . 

CC -!- SIMILARITY: BELONGS TO THE GTP-BINDING ELONGATION FACTOR FAMILY. 



CC EF-G/EF-2 SUBFAMILY. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; Z97178; CAB09900.1; -. 

DR PIR; T14579; T14579. 

DR InterPro; IPR000795; EF^GTPbind. 

DR InterPro; IPR000640; EFG_C. 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR Pfam; PF00679; EFG_C; 1. 

DR Pfam; PF03764; EFG_IV; 1. 

DR Pfam; PF00009; GTP_EFTU; 1. 

DR Pfam; PF03144; GTP~EFTU_D2; 1. 

DR PRINTS; PR00315; ELONGATNFCT . 

DR PROSITE; PS00301; EFACTOR_GTP; 1. 

KW Elongatic 

FT NP_BIND 

FT NP_BIND 

FT NP_BIND 

FT MOD_RES 

FT MOD_RES 

FT MOD_RES 

SQ SEQUENCE 

Query Match 

Best Local Similarity 35.7%; Pred. No. 3.2e-52; 

Matches 175; Conservative 111; Mismatches 179; Indels 25; Gaps 8; 

Qy 9 NLILS PLLGN VCFSSSQYSICFTLGSFAKI YADTFGDINYQEFAKRL 55 

1 = 1:: MM: I ih : Ml : I I I : I I I I : : : Ml 

Db 18 6 NVIMATYEDPLLGDVQVYPEKGTVAFSAGLHGWAFTLSNFAKMYASKFG-VDESKMMERL 2 44 

Qy 56 WGDIYFNPKTRKF-TKKAPTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIH 114 

M : : M I M M M : : I : I MM I M MM : I I : M M 

Db 245 WGENFFDPATKKWTTKNS GNAS CKRGFVQFCYEPI KQI I AACMNDQKDKLLAHVTKLGI Q 304 

Qy 115 LTKEELKLNIRPLLRLVCKKFFGEFTGFVDMCVQHIPSPKVGAKPKIEHTYTGGVDSDLG 174 

: M I MM: | : : : : : j : | : | | | : : ; | : | | : | 

Db 305 MKTEEKDLMGRPLMKRVMQTWLPAS SALLEMMIHHLPSPATAQRYRVENLYEGPMDDVYA 364 

Qy 175 EAMSDCDPDGPLMCHTTKMFSTHDGVQFHPFGRVLSGTIHAGQPVKVLGENYTLEDEEDS 234 

M : M M M M : Ml I : I MM M : I I : : : I I I : : : I 

Db 365 TAIRNCDPEGPLMLYVSKMIPASDKGRFFAFGRVFAGKVSTGMKVRIMGPNYVPGEKKDL 424 

Qy 235 QICTVGRLWISVARYHIEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQIFRPLKF 294 

: II I : : I M II I : I : I i ! I I I : I : : I I : M 

Db 42 5 YVKNVQRTVIWMGKKQETVEDVPCGNTVALVGLDQYITKNATLTNEK-ESDAHPIRAMKF 4 83 

Qy 2 95 NTTSVIKIAVEPWPSELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMH 354 

: : I : : : II : I M I I : : M M : : I I I : : I I M I I : I I II I : I : : 



factor; 


GTP- 


binding; Protein biosynthesis; Phosphorylation 


26 


33 




GTP {BY SIMILARITY) . 


104 


108 




GTP (BY SIMILARITY) . 


158 


161 




GTP (BY SIMILARITY) . 


57 


57 




PHOSPHORYLATION (BY SIMILARITY) . 


59 


59 




PHOSPHORYLATION (BY SIMILARITY) . 


700 


700 




DIPHTHAMIDE (BY SIMILARITY) . 


843 AA; 


93799 


MW; D11325B42A51F4F8 CRC64; 




32 


. 3h 


Score 816.5; DB 1; Length 843; 



Db 



4 84 SVS P WRVAVQCKVASDLPKLVEGLKRLAKS DPMVVCS I EESGEHI IAGAGELHLEI CLK 54 3 



Qy 355 DLRKMY-SEIDIKVMPVVTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGIAEDIE 413 

! I : : : I : I I I I : I I I I : : I : : : I I I I : : I f I : I : I I I I I : 

Db 544 DLQDDFMGGAEIIKSDPWSFRETVLDRSVRTVMSKSPNKHNRLYMEARPMEEGLAEAID 603 

Qy 414 NEWQ1TWNRKKLGEFFQTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKAL L 471 

: : I : : I I I I : I I I I I : I I I I : : I I : I : j 

Db 604 EGRIGPRDDPKNRSKI1AEEYGWDKDLAKKIWCFGPETTGPNMVVD MCKGVQYL 657 

Qy 472 GSVKDSIVQG 481 

: I 1 I : 1 I 
Db 658 NEIKDSWAG 667 



RESULT 15 
EF2_ YEAST 

ID EF2_ YEAST STANDARD; PRT ; 842 AA. 

AC P32324; 

DT 01-OCT-1993 (Rel. 27, Created) 

DT 01-OCT-1993 (Rel. 27, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Elongation factor 2 (EF-2). 

GN (EFT1 OR YOR133W OR 03317 OR YOR3317W) AND (EFT2 OR YDR385W) . 

OS Saccharomyces cerevisiae (Baker's yeast). 

OC Eukaryota; Fungi; As comycota ; Saccharomycotina ; Saccharomycetes; 

OC Saccharomycetales ; Saccharomycetaceae ; Saccharomyces. 

OX NCBI_TaxID=4932; 

RN [1] 

RP SEQUENCE FROM N.A. (EFTl AND EFT2 ) . 

RX MEDLINE=921127 60; PubMed=1730643 ; 

RA Perentesis J. P., Phan L.D., Laporte D.C., Livingston D.M., 

RA Bodley J. W. ; 

RT "Saccharomyces cerevisiae elongation factor 2. Genetic cloning, 

RT characterization of expression, and G-domain modeling. "; 

RL J. Biol. Chem. 2 67:1190-1197(1992). 

RN [2] 

RP SEQUENCE FROM N.A. (EFTl) . 

RX MEDLINE=97 34 436 8; PubMed=92 0 0 8 15 ; 

RA Voss H., Benes V., Andrade M.A. , Valencia A., Rechmann S., Teodoru C. 

RA Schwager C, Paces V., Sander C, Ansorge W. ; 

RT "DNA sequencing and analysis of 130 kb from yeast chromosome XV."; 

RL Yeast 13:655-672(19 97). 

RN [3] 

RP SEQUENCE FROM N.A. (EFTl) . 

RC STRAiN=S288c / FY1679; 

RX MEDLINE=9 7 0 6002 0; PubMed=8 904341 ; 

RA Wiemann S., Rechmann S., Benes V., Voss H., Schwager C, Vlcek C, 

RA Stegemann J., Zimmermann J., Erfle H., Paces V., Ansorge W. ; 

RT "Sequencing and analysis of 51 kb on the right arm of chromosome XV 

RT from Saccharomyces cerevisiae reveals 30 open reading frames.* 1 ; 

RL Yeast 12:281-288(1996). 

RN [4] 

RP SEQUENCE FROM N.A. (EFT2) . 

RA Dietrich F.S., Mulligan J., Allen E . , Araujo R. , Aviles E., Berno A., 

RA Carpenter J., Chen E., Cherry J.M. , Chung E . , Duncan M . , 

RA Hunicke-Smith S., Hyman R. , Komp C, Lashkari D., Lew H., Lin D., 



RA Mosedale D . , Nakahara K., Namath A., Oefner P., Oh C, Petel F.X., 

RA Roberts D., Schramm S., Schroeder M . / Shogren T., Shroff N . , 

RA Winant A., Yelton M. , Botstein D., Davis R.W.; 

RL Submitted (JUL-1995) to the EMBL/ GenBank/DDB J databases. 

RN [5] 

RP SEQUENCE OF 411-422 AND 505-513. 

RC STRAIN=S288c; 

RX MEDLINE=95203288; PubMed=7 8 95733 ; 

RA Garrels J.I., Futcher B., Kobayashi R. , Latter G.I., Schwender B . , 

RA Volpe T., Warner J.R., McLaughlin C.S.; 

RT "Protein identifications for a Saccharomyces cerevisiae protein 

RT database."; 

RL Electrophoresis 15:14 66-14 8 6(1994) . 

CC -!- FUNCTION: THIS PROTEIN PROMOTES THE GTP- DEPENDENT TRANSLOCATION 
CC OF THE NASCENT PROTEIN CHAIN FROM THE A-SITE TO THE P-SITE OF THE 

CC RIBOSOME. 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic. 

CC -!- SIMILARITY: BELONGS TO THE GTP- BINDING ELONGATION FACTOR FAMILY. 
CC EF-G/EF— 2 SUBFAMILY. 

cc — 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch ) . 

CC 

DR EMBL; M59369; AAA21646.1; -. 

DR EMBL; M59370; AAA51398.1; -. 

DR EMBL; X90518; CAA62116.1; 

DR EMBL; X94335; CAA64052.1; -. 

DR EMBL; Z75041; CAA99332.1; -. 

DR EMBL; U32274; AAB64827.1; -. 

DR EMBL; U28373; AAB64821.1; -. 

DR PIR; A41778; A41778. 

DR PDB; 1N0U; ll-FEB-03. 

DR SWISS-2DPAGE; P32324; YEAST. 

DR SGD; S0005659; EFT1. 

DR SGD; S0002793; EFT2 . 

DR InterPro; IPR000795; EF_GTPbind. 

DR InterPro; IPR000640; EFG_C . 

DR InterPro; IPR005517; EFG_IV. 

DR InterPro; IPR004161; EFTU_D2 . 

DR InterPro; IPR005225; Small_GTP. 

DR Pfam; PF00679; EFG_C; 1. 

DR Pfam; PF03764; EFG_IV; 1. 

DR Pfam; PF00009; GTP_EFTU; 1. 

DR Pfam; PF03144; GTP_EFTU_D2; 1. 

DR PRINTS; PR00315; ELONGATNFCT . 

DR TIGRFAMs; TIGR00231; small_GTP; 1. 

DR PROSITE; PS00301; EFACTOR_GTP; 1. 

KW Elongation factor; GTP-binding; Protein biosynthesis; Phosphorylation; 

KW 3D-structure . 

FT NP_BIND 26 33 GTP (BY SIMILARITY) . 

FT NP_BIND 104 108 GTP (BY SIMILARITY) . 

FT NP BIND 158 161 GTP (BY SIMILARITY) . 



FT MOD_RES 699 699 DIPHTHAMIDE (BY SIMILARITY) . 

SQ SEQUENCE 842 AA; 93289 MW; FD2F8073CB9B66AA CRC64; 

Query Match 32.0%; Score 807.5; DB 1; Length 842; 

Best Local Similarity 35.7%; Pred. No. 1.5e-51; 

Matches 168; Conservative 106; Mismatches 182; Indels 15; Gaps 5 

Qy 14 PLLGNVCFSSSQYSICFTLGSFAKI YADTFGDINYQEFAKRLWGDI YFNPKTRKFTKK — 71 

I I I I I : II: II II II:: : I I I I I : I I I I I : I : I I 
Db 2 04 PARGTVAFGSGLHGWAFTIRQFATRYAKKFG-VDKAKMMDRLWGDSFFNPKTKKWTNKDT 2 62 

Qy 72 -APTSSSQRSFVEFILEPLYKILAQWGDVDTSLPRTLDELGIHLTKEELKLNIRPLLRL 130 

I : I : i | i i : | : : : : : : : I I : : I I I : | | : I I : : 

Db 263 DAEGKPLERAFNMFILDPIFRLFTAIMNFKKDEIPVLLEKLEIVLKGDEKDLEGKALLKV 322 

Qy 131 VCKKFFGEFTGFVDMCVQHI P S PKVGAKPKI EHTYTGGVDS DLGEAMS DCDPDGPLMCHT 190 

I : I I : : I I I : I I I : I II I I : : I j I I I : 

Db 323 VMRKFLPAADALLEMIVLHLPSPVTAQAYRAEQLYEGPADDANCIAIKNCDPKADLMLYV 382 

Qy 191 TKMFSTHDGVQFHPFGRVLSGTIHAGQ PVKVLGENYTLEDEEDSQICTVGRLWISVARYH 250 

: I I II : I : i I M : I I : : I I I : : I I I : : I I : I : : : I : 

Db 383 S KMVPT S DKGRFYAFGRVFAGTVKS GQKVRI QGPN YVP GKKDDLFI KAI QRWLMMGRFV 442 

Qy 251 IEVNRVPAGNWVLIEGVDQPIVKTATITEPRGNEEAQI FRPLKFNTTSVIKIAVEPVNPS 310 

: : | | | I : : | : | | : : M I : I : I i : : | | : : | : : : || I I : 

Db 443 EP I DDCPAGNI I GLVGI DQFLLKTGTLTT SETAHNMKVMKFSVSPWQVAVEVKNAN 499 

Qy 311 ELPKMLDGLRKVNKSYPSLTTKVEESGEHVILGTGELYLDCVMHDLRKMYSEIDIKVADP 370 

: I I I : : : I I : : : : I I I : I : Mill:: [| I I I : I : : II : : : M : : I 
Db 500 DLPKLVEGLKRLSKSDPCVLTYMSESGEHIVAGTGELHLEICLQDLEHDHAGVPLKISPP 559 

Qy 371 VVTFCETWETSSLKCFAETPNKKNKITMIAEPLEKGLAEDIENEWQITWNRKKLGEFF 430 

I I : M I II : : : | | | | : | : | | I : : : : : III:: : I 

Db 560 WAYRETVESESSQTALSKSPNKHNRI YLKAEPIDEEVS LAI ENGI INPRDDFKARARIM 619 

Qy 431 QTKYDWDLLAARSIWAFGPDATGPNILVDDTLPSEVDKAL — LGSVKDSIV 479 

III: I I I I I I I I I I I : : : I I I I : I : I I I : I 

Db 620 ADDYGWDVTDARKIWCFGPDGNGPNLVIDQT KAVQYLHEIKDSW 664 



Search completed: January 30, 2004, 11:25:06 
Job time : 19.716 sees 



